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Preface 



This is a textbook on electricity and magnetism, designed for an undergraduate course at 
the junior or senior level. It can be covered comfortably in two semesters, maybe even 
with room to spare for special topics (AC circuits, numerical methods, plasma physics, 
transmission lines, antenna theory, etc.) A one-semester course could reasonably stop 
after Chapter 7. Unlike quantum mechanics or thermal physics (for example), there is a 
fairly general consensus with respect to the teaching of electrodynamics; the subjects to 
be included, and even their order of presentation, are not particularly controversial, and 
textbooks differ mainly in style and tone. My approach is perhaps less formal than most; I 
think this makes difficult ideas more interesting and accessible. 

For the third edition I have made a large number of small changes, in the interests of 
clarity and grace. I have also modified some notation to avoid inconsistencies or ambiguities. 
Thus the Cartesian unit vectors i, j, and k have been replaced with x, y, and z, so that all 
vectors are bold, and all unit vectors inherit the letter of the corresponding coordinate. 
(This also frees up k to be the propagation vector for electromagnetic waves.) It has always 
bothered me to use the same letter r for the spherical coordinate (distance from the origin) 
and the cylindrical coordinate (distance from the z axis). A common alternative for the 
latter is p, but that has more important business in electrodynamics, and after an exhaustive 
search I settled on the underemployed letter s; I hope this unorthodox usage will not be 
confusing. 

Some readers have urged me to abandon the script letter >i (the vector from a source point 
r' to the field point r) in favor of the more explicit r — r'. But this makes many equations 
distractingly cumbersome, especially when the unit vector i is involved. I know from my 
own teaching experience that unwary students are tempted to read >i as r — it certainly makes 
the integrals easier! I have inserted a section in Chapter 1 explaining this notation, and I 
hope that will help. If you are a student, please take note: -t = r — r', which is not the same 
as r. If you’re a teacher, please warn your students to pay close attention to the meaning of 
< 4 .. I think it’s good notation, but it does have to be handled with care. 

The main structural change is that I have removed the conservation laws and potentials 
from Chapter 7, creating two new short chapters (8 and 10). This should more smoothly 
accommodate one-semester courses, and it gives a tighter focus to Chapter 7. 

I have added some problems and examples (and removed a few that were not effective). 
And I have included more references to the accessible literature (particularly the American 
Journal of Physics). I realize, of course, that most readers will not have the time or inch- 
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nation to consult these resources, but I think it is worthwhile anyway, if only to emphasize 
that electrodynamics, notwithstanding its venerable age, is very much alive, and intriguing 
new discoveries are being made all the time. I hope that occasionally a problem will pique 
your curiosity, and you will be inspired to look up the reference — some of them are real 
gems. 

As in the previous editions, I distinguish two kinds of problems. Some have a specific 
pedagogical purpose, and should be worked immediately after reading the section to which 
they pertain; these I have placed at the pertinent point within the chapter. (In a few cases 
the solution to a problem is used later in the text; these are indicated by a bullet (•) in the 
left margin.) Longer problems, or those of a more general nature, will be found at the end 
of each chapter. When I teach the subject I assign some of these, and work a few of them 
in class. Unusually challenging problems are flagged by an exclamation point (!) in the 
margin. Many readers have asked that the answers to problems be provided at the back 
of the book; unfortunately, just as many are strenuously opposed. I have compromised, 
supplying answers when this seems particularly appropriate. A complete solution manual 
is available (to instructors) from the publisher. 

I have benefitted from the comments of many colleagues — I cannot list them all here. 
But I would like to thank the following people for suggestions that contributed specifically 
to the third edition: Burton Brody (Bard), Steven Grimes (Ohio), Mark Heald (Swarth- 
more), Jim McTavish (Liverpool), Matthew Moelter (Puget Sound), Paul Nachman (New 
Mexico State), Gigi Quartapelle (Milan), Carl A. Rotter (West Virginia), Daniel Schroeder 
(Weber State), Juri Silmberg (Ryerson Polytechnic), Walther N. Spjeldvik (Weber State), 
Larry Tankersley (Naval Academy), and Dudley Towne (Amherst). Practically everything I 
know about electrodynamics — certainly about teaching electrodynamics — I owe to Edward 
Purcell. 



David J. Griffiths 
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What is electrodynamics, and how does it fit into the 
general scheme of physics? 



Four Realms of Mechanics 

In the diagram below I have sketched out the four great realms of mechanics: 



Classical Mechanics 

(Newton) 


Quantum Mechanics 

(Bohr, Heisenberg, 
Schrodinger, et al .) 


Special Relativity 

(Einstein) 


Quantum Field Theory 

(Dirac, Pauli, Feynman, 
Schwinger, et al.) 



Newtonian mechanics was found to be inadequate in the early years of this century — it’s 
all right in “everyday life,” but for objects moving at high speeds (near the speed of light) 
it is incorrect, and must be replaced by special relativity (introduced by Einstein in 1905); 
for objects that are extremely small (near the size of atoms) it fails for different reasons, 
and is superseded by quantum mechanics (developed by Bohr, Schrodinger, Heisenberg, 
and many others, in the twenties, mostly). For objects that are both very fast and very 
small (as is common in modem particle physics), a mechanics that combines relativity and 
quantum principles is in order: this relativistic quantum mechanics is known as quantum 
field theory — it was worked out in the thirties and forties, but even today it cannot claim 
to be a completely satisfactory system. In this book, save for the last chapter, we shall 
work exclusively in the domain of classical mechanics, although electrodynamics extends 
with unique simplicity to the other three realms. (In fact, the theory is in most respects 
automatically consistent with special relativity, for which it was, historically, the main 
stimulus.) 



xi 




ADVERTISEMENT 



xii 

Four Kinds of Forces 



Mechanics tells us how a system will behave when subjected to a given force. There are 
just four basic forces known (presently) to physics: I list them in the order of decreasing 
strength: 



1. Strong 

2. Electromagnetic 

3. Weak 

4. Gravitational 



The brevity of this list may surprise you. Where is friction? Where is the “normal” force 
that keeps you from falling through the floor? Where are the chemical forces that bind 
molecules together? Where is the force of impact between two colliding billiard balls? The 
answer is that all these forces are electromagnetic. Indeed, it is scarcely an exaggeration 
to say that we live in an electromagnetic world — for virtually every force we experience in 
everyday life, with the exception of gravity, is electromagnetic in origin. 

The strong forces, which hold protons and neutrons together in the atomic nucleus, 
have extremely short range, so we do not “feel” them, in spite of the fact that they are a 
hundred times more powerful than electrical forces. The weak forces, which account for 
certain kinds of radioactive decay, are not only of short range; they are far weaker than 
electromagnetic ones to begin with. As for gravity, it is so pitifully feeble (compared to all 
of the others) that it is only by virtue of huge mass concentrations (like the earth and the sun) 
that we ever notice it at all. The electrical repulsion between two electrons is 10 42 times 
as large as their gravitational attraction, and if atoms were held together by gravitational 
(instead of electrical) forces, a single hydrogen atom would be much larger than the known 
universe. 

Not only are electromagnetic forces overwhelmingly the dominant ones in everyday 
life, they are also, at present, the only ones that are completely understood. There is, of 
course, a classical theory of gravity (Newton’s law of universal gravitation) and a relativistic 
one (Einstein’s general relativity), but no entirely satisfactory quantum mechanical theory 
of gravity has been constructed (though many people are working on it). At the present 
time there is a very successful (if cumbersome) theory for the weak interactions, and a 
strikingly attractive candidate (called chromodynamics) for the strong interactions. All 
these theories draw their inspiration from electrodynamics; none can claim conclusive 
experimental verification at this stage. So electrodynamics, a beautifully complete and 
successful theory, has become a kind of paradigm for physicists: an ideal model that other 
theories strive to emulate. 

The laws of classical electrodynamics were discovered in bits and pieces by Franklin, 
Coulomb, Ampere, Faraday, and others, but the person who completed the job, and packaged 
it all in the compact and consistent form it has today, was James Clerk Maxwell. The theory 
is now a little over a hundred years old. 
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The Unification of Physical Theories 

In the beginning, electricity and magnetism were entirely separate subjects. The one dealt 
with glass rods and cat’s fur, pith balls, batteries, currents, electrolysis, and lightning; the 
other with bar magnets, iron filings, compass needles, and the North Pole. But in 1 820 
Oersted noticed that an electric current could deflect a magnetic compass needle. Soon 
afterward, Ampere correctly postulated that all magnetic phenomena are due to electric 
charges in motion. Then, in 1831, Faraday discovered that a moving magnet generates an 
electric current. By the time Maxwell and Lorentz put the finishing touches on the theory, 
electricity and magnetism were inextricably intertwined. They could no longer be regarded 
as separate subjects, but rather as two aspects of a single subject: electromagnetism. 

Faraday had speculated that light, too, is electrical in nature. Maxwell’s theory provided 
spectacular justification for this hypothesis, and soon optics — the study of lenses, mirrors, 
prisms, interference, and diffraction — was incorporated into electromagnetism. Hertz, who 
presented the decisive experimental confirmation for Maxwell’s theory in 1888, put it this 
way: “The connection between light and electricity is now established ... In every flame, 
in every luminous particle, we see an electrical process . . . Thus, the domain of electricity 
extends over the whole of nature. It even affects ourselves intimately: we perceive that we 
possess ... an electrical organ — the eye.” By 1900, then, three great branches of physics, 
electricity, magnetism, and optics, had merged into a single unified theory. (And it was 
soon apparent that visible light represents only a tiny “window” in the vast spectrum of 
electromagnetic radiation, from radio though microwaves, infrared and ultraviolet, to x- 
rays and gamma rays.) 

Einstein dreamed of a further unification, which would combine gravity and electrody- 
namics, in much the same way as electricity and magnetism had been combined a century 
earlier. His unified field theory was not particularly successful, but in recent years the same 
impulse has spawned a hierarchy of increasingly ambitious (and speculative) unification 
schemes, beginning in the 1960s with the electroweak theory of Glashow, Weinberg, and 
Salam (which joins the weak and electromagnetic forces), and culminating in the 1 980s with 
the superstring theory (which, according to its proponents, incorporates all four forces in a 
single “theory of everything”). At each step in this hierarchy the mathematical difficulties 
mount, and the gap between inspired conjecture and experimental test widens; nevertheless, 
it is clear that the unification of forces initiated by electrodynamics has become a major 
theme in the progress of physics. 

The Field Formulation of Electrodynamics 

The fundamental problem a theory of electromagnetism hopes to solve is this: I hold up 
a bunch of electric charges here (and maybe shake them around) — what happens to some 
other charge, over there? The classical solution takes the form of a field theory: We say 
that the space around an electric charge is permeated by electric and magnetic fields (the 
electromagnetic “odor,” as it were, of the charge). A second charge, in the presence of these 
fields, experiences a force; the fields, then, transmit the influence from one charge to the 
other — they mediate the interaction. 
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When a charge undergoes acceleration, a portion of the field “detaches” itself, in a 
sense, and travels off at the speed of light, carrying with it energy, momentum, and angular 
momentum. We call this electromagnetic radiation. Its existence invites (if not compels ) 
us to regard the fields as independent dynamical entities in their own right, every bit as 
“real” as atoms or baseballs. Our interest accordingly shifts from the study of forces 
between charges to the theory of the fields themselves. But it takes a charge to produce an 
electromagnetic field, and it takes another charge to detect one, so we had best begin by 
reviewing the essential properties of electric charge. 

Electric Charge 

1. Charge comes in two varieties, which we call “plus” and “minus,” because their effects 
tend to cancel (if you have +q and -q at the same point, electrically it is the same as having 
no charge there at all). This may seem too obvious to warrant comment, but I encourage you 
to contemplate other possibilities: what if there were 8 or 10 different species of charge? 
(In chromodynamics there are, in fact, three quantities analogous to electric charge, each 
of which may be positive or negative.) Or what if the two kinds did not tend to cancel? 
The extraordinary fact is that plus and minus charges occur in exactly equal amounts, to 
fantastic precision, in bulk matter, so that their effects are almost completely neutralized. 
Were it not for this, we would be subjected to enormous forces: a potato would explode 
violently if the cancellation were imperfect by as little as one part in 10 10 . 

2. Charge is conserved: it cannot be created or destroyed — what there is now has always 
been. (A plus charge can “annihilate” an equal minus charge, but a plus charge cannot simply 
disappear by itself — something must account for that electric charge.) So the total charge of 
the universe is fixed for all time. This is called global conservation of charge. Actually, I can 
say something much stronger: Global conservation would allow for a charge to disappear 
in New York and instantly reappear in San Francisco (that wouldn’t affect the total), and yet 
we know this doesn’t happen. If the charge was in New York and it went to San Francisco, 
then it must have passed along some continuous path from one to the other. This is called 
local conservation of charge. Later on we’ll see how to formulate a precise mathematical 
law expressing local conservation of charge — it’s called the continuity equation. 

3. Charge is quantized. Although nothing in classical electrodynamics requires that it be 
so, the fact is that electric charge comes only in discrete lumps — integer multiples of the 
basic unit of charge. If we call the charge on the proton +c, then the electron carries charge 
-e, the neutron charge zero, the pi mesons +e, 0, and -e, the carbon nucleus +6e, and 
so on (never 7.392c, or even 1 /2c). 1 This fundamental unit of charge is extremely small, 
so for practical purposes it is usually appropriate to ignore quantization altogether. Water, 
too, “really” consists of discrete lumps (molecules); yet, if we are dealing with reasonably 
large large quantities of it we can treat it as a continuous fluid. This is in fact much closer 
to Maxwell’s own view; he knew nothing of electrons and protons — he must have pictured 

1 Actually, protons and neutrons are composed of three quarks, which carry fractional charges (± | e and ± j e). 
However, free quarks do not appear to exist in nature, and in any event this does not alter the fact that charge is 
quantized; it merely reduces the size of the basic unit. 
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charge as a kind of “jelly” that could be divided up into portions of any size and smeared 
out at will. 

These, then, are the basic properties of charge. Before we discuss the forces between 
charges, some mathematical tools are necessary; their introduction will occupy us in Chap- 
ter 1. 



Units 



The subject of electrodynamics is plagued by competing systems of units, which sometimes 
render it difficult for physicists to communicate with one another. The problem is far worse 
than in mechanics, where Neanderthals still speak of pounds and feet; for in mechanics 
at least all equations look the same, regardless of the units used to measure quantities. 
Newton’s second law remains F = ma, whether it is feet-pounds-seconds, kilograms- 
meters-seconds, or whatever. But this is not so in electromagnetism, where Coulomb’s law 
may appear variously as 



— r— n. (Gaussian), or 
n- 1 



1 <7l<72- 



47reo 't 2 



(SI), 



or 



1 

4jv n?- 



(HL). 



Of the systems in common use, the two most popular are Gaussian (cgs) and S»I (mks). Ele- 
mentary particle theorists favor yet a third system: Heaviside-Lorentz. Although Gaussian 
units offer distinct theoretical advantages, most undergraduate instructors seen! to prefer 
SI, I suppose because they incorporate the familiar household units (volts, amperes, and 
watts). In this book, therefore, I have used SI units. Appendix C provides a “dictionary” 
for converting the main results irlto Gaussian units. 




Chapter 1 

Vector Analysis 



1.1 Vector Algebra 

1.1.1 Vector Operations 

If you walk 4 miles due north and then 3 miles due east (Fig. 1.1), you will have gone a 
total of 7 miles, but you’re not 7 miles from where you set out — you’re only 5. We need an 
arithmetic to describe quantities like this, which evidently do not add in the ordinary way. 
The reason they don’t, of course, is that displacements (straight line segments going from 
one point to another) have direction as well as magnitude (length), and it is essential to 
take both into account when you combine them. Such objects are called vectors: velocity, 
acceleration, force and momentum are other examples. By contrast, quantities that have 
magnitude but no direction are called scalars: examples include mass, charge, density, 
and temperature. I shall use boldface (A, B, and so on) for vectors and ordinary type 
for scalars. The magnitude of a vector A is written | A j or, more simply, A. In diagrams, 
vectors are denoted by arrows: the length of the arrow is proportional to the magnitude of 
the vector, and the arrowhead indicates its direction. Minus A (—A) is a vector with the 
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same magnitude as A but of opposite direction (Fig. 1 .2). Note that vectors have magnitude 
and direction but not location: a displacement of 4 miles due north from Washington is 
represented by the same vector as a displacement 4 miles north from Baltimore (neglecting, 
of course, the curvature of the earth). On a diagram, therefore, you can slide the arrow 
around at will, as long &s you don’t change its length or direction. 

We define four vector operations: addition and three kinds of multiplication. 

(i) Addition of two vectors. Place the tail of B at the head of A; the sum, A + B, is 
the vector from the tail of A to the head of B (Fig. 1 .3). (This rule generalizes the obvious 
procedure for combining two displacements.) Addition is commutative: 

A + B = B + A; 

3 miles east followed by 4 miles north gets you to the same place as 4 miles north followed 
by 3 miles east. Addition is also associative: 

(A + B) + C = A + (B + C). 

To subtract a vector (Fig. 1 .4), add its opposite: 



A - B = A + (-B). 




Figure 1.3 




(ii) Multiplication by a scalar. Multiplication of a vector by a positive scalar a mul- 
tiplies the magnitude but leaves the direction unchanged (Fig. 1.5). (If a is negative, the 
direction is reversed.) Scalar multiplication is distributive: 

a( A + B) = aA + aB. 

(iii) Dot product of two vectors. The dot product of two vectors is defined by 

A B = AB cosF, (1.1) 

where 6 is the angle they form when placed tail-to-tail (Fig. 1.6). Note that A B is itself a 
scalar (hence the alternative name scalar product). The dot product is commutative , 



A B = B A, 
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A 





Figure 1.5 




and distributive, 



A • (B + C) = A ■ B + A • C. 



(1.2) 



Geometrically, A • B is the product of A times the projection of B along A (or the product 
of B times the projection of A along B). If the two vectors are parallel, then A • B = A B. 
In particular, for any vector A, 

A ■ A = A 2 . (1.3) 



If A and B are perpendicular, then A ■ B = 0. 



Example 1.1 



Let C = A — B (Fig. 1 .7), and calculate the dot product of C with itself. 

Solution: 

CC = (A-B)(A-B)=AA — AB-BA + BB, 
or 

C 2 = A 2 + B 2 - 2A B cost?. 

This is the law of cosines. 



(iv) Cross product of two vectors. The cross product of two vectors is defined by 

A x B = Afisin#n, (1.4) 

where n is a unit vector (vector of length 1 ) pointing perpendicular to the plane of A and 
B. (I shall use a hat O to designate unit vectors.) Of course, there are two directions 
perpendicular to any plane: “in” and “out.” The ambiguity is resolved by the right-hand 
rule: let your fingers point in the direction of the first vector and curl around (via the smaller 
angle) toward the second; then your thumb indicates the direction of n. (In Fig. 1 .8 A x B 
points into the page; B x A points out of the page.) Note that A x B is itself a vector (hence 
the alternative name vector product). The cross product is distributive, 



A x (B + C) = (A x B) + (A x C), 



(1.5) 
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Figure 1.8 



but not commutative. In fact, 

(B x A) = — (A x B). (1.6) 

Geometrically, |A x B| is the area of the parallelogram generated by A and B (Fig. 1.8). If 
two vectors are parallel, their cross product is zero. In particular, 

A x A = 0 

for any vector A. 



Problem 1.1 Using the definitions in Eqs. 1.1 and 1.4, and appropriate diagrams, show that 
the dot product and cross product are distributive, 

a) when the three vectors are coplanar; 

b) in the general case. 

Problem 1.2 Is the cross product associative? 

(A x B) x C = A x (B x C). 

If so, prove it; if not, provide a counterexample. 



1.1.2 Vector Algebra: Component Form 

In the previous section I defined the four vector operations (addition, scalar multiplication, 
dot product, and cross product) in “abstract” form — that is, without reference to any partic- 
ular coordinate system. In practice, it is often easier to set up Cartesian coordinates ^ , y, z 
and work with vector “components.” Let x, y, and z be unit vectors parallel to the jc, y, and 
Z axes, respectively (Fig. 1.9(a)). An arbitrary vector A can be expanded in terms of these 
basis vectors (Fig. 1.9(b)): 

A = A x x + A y y + A z z. 
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Figure 1.9 



The numbers A x , A y , and A z , are called components of A; geometrically, they are the 
projections of A along the three coordinate axes. We can now reformulate each of the four 
vector operations as a rule for manipulating components: 

A + B = (A*x + A y y + A z z) + ( B x x. + B y y + B z z) 

= (Ax + B x )x + {Ay + B y )y + {A z + B z ) z. (1.7) 

(i) Rule: To add vectors, add like components. 

aA = (aA x )x+ (aA y )y + (aA z )z. (1.8) 

(ii) Rule: To multiply by a scalar, multiply each component. 

Because x, y, and z are mutually perpendicular unit vectors, 

x-x = y-y = z-z=l; x-y = x-z = y-z = 0. (1.9) 

Accordingly, 

A B = (AxX + A v y + A z z) • (B x x + B y y + B z z) 

= A X B X + A y B y + A Z B Z . (1.10) 

(iii) Rule: To calculate the dot product, multiply like components, and add. 

In particular, 

A • A = A 2 X + A 2 y + A 2 Z , 
so 

A = y/A 2 x + A y + A\. (1.11) 

(This is, if you like, the three-dimensional generalization of the Pythagorean theorem.) Note 
that the dot product of A with any unit vector is the component of A along that direction 
(thus A • x = A x , A • y = A y , and A • z = A z ). 
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Similarly, 1 



Therefore, 


x x x = yxy 

x x y = — y x x 

y x z = — z x y 

Z X X = —X X z 


= z x z = 0, 
= z, 

= X, 

= y. 


(1.12) 


A x B = 


(A x x + A y y + A z i) x 


(B x x + By y + B z i) 


(1.13) 



= (A y B z - A z B y )x + ( A Z B X - A X B Z ) y + (A x B y - A y B x )z. 
This cumbersome expression can be written more neatly as a determinant: 



A x B = 



x y z 

Ay Ay A z 

B x By B z 



(1.14) 



(iv) Rule: To calculate the cross product, form, the determinant whose first row is x, y, z, 
whose second row is A (in component form), and whose third row is B. 

Example 1.2 



Find the angle between the face diagonals of a cube. 

Solution: We might as well use a cube of side 1, and place it as shown in Fig. 1.10, with one 
corner at the origin. The face diagonals A and B are 

A = lx + 0y+lz; B = Ox + 1 y + 1 z. 




Figure 1.10 



'These signs pertain to a right-handed coordinate system (x-axis out of the page, y-axis to the right, z-axis up, 
or any rotated version thereof). In a left-handed system (z-axis down) the signs are reversed: x x y = — z, and so 
on. We shall use right-handed systems exclusively. 
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So, in component form, 

A-B = 1 - 0 + 0- 1 + 1 ■ 1 = 1. 

On the other hand, in “abstract” form, 

A • B = AB cos 9 = \fl\fl cos 9 = 2 cos 9. 



Therefore, 



cos# = 1/2, or #=60°. 



Of course, you can get the answer more easily by drawing in a diagonal across the top of the 
cube, completing the equilateral triangle. But in cases where the geometry is not so simple, 
this device of comparing the abstract and compohent forms of the dot product can be a very 
efficient means of finding angles. 



Problem 1.3 Find the angle between the body diagonals of a cube. 

Problem 1.4 Use the cross product to find the cdmponents of the unit vector n perpendicular 
to the plane shown in Fig. 1.11. 



1.1.3 Triple Products 

Since the cross product of two vectors is itself a vector, it can be dotted or crossed with a 
third vector to form a triple product. 

(i) Scalar triple product: A • (B x C). Geometrically, |A • (B x C)| is the volume 
of the parallelepiped generated by A, B, and C, since |B x C| is the area of the base, and 
A cos # is the altitude (Fig. 1.12). Evidently, 

A- (B x C) = B • (C x A) = C- (A x B), (1.15) 

for they all correspond to the same figure. Note that “alphabetical” order is preserved — in 
view of Eq. 1.6, the “nonalphabetical” triple products, 

A • (C x B) = B • (A x C) = C • (B x A), 





Figure 1.11 



Figure 1.12 
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have the opposite sign. In component form, 

A x Ay A z 

A • (B x C) = B x By B z . (1.16) 

C X Cy C- 

Note that the dot and cross can be interchanged: 

A • (B x C) = (A x B) • C 

(this follows immediately from Eq. 1.15); however, the placement of the parentheses is 
critical: (A • B) x C is a meaningless expression — you can’t make a cross product from a 
scalar and a vector. 

(ii) Vector triple product: A x (B x C). The vector triple product can be simplified 
by the so-called BAC-CAB rule: 

A x (B x C) = B(A • C) — C(A • B). (1.17) 

Notice that 

(A x B) x C = -C x (A x B) = -A(B • C) + B(A • C) 

is an entirely different vector. Incidentally, all higher vector products can be similarly 
reduced, often by repeated application of Eq. 1 . 17, so it is never necessary for an expression 
to contain more than one cross product in any term. For instance, 

(AxB)-(CxD) = (A • C)(B • D) — (A • D)(B • C); 

A x (B x (C x D)) = B(A • (C x D)) - (A ■ B)(C x D). (1.18) 



Problem 1.5 Prove the BAC-CAB rule by writing out both sides in component form. 
Problem 1.6 Prove that 

[A x (B x C)] + [B x (C x A)] + [C x (A x B)] = 0. 

Under what conditions does A x (B x C) = (A x B) x C? 



1.1.4 Position, Displacement, and Separation Vectors 

The location of a point in three dimensions can be described by listing its Cartesian coor- 
dinates (x, y, z). The vector to that point from the origin (Fig. 1.13) is called the position 
vector: 



r = xx + yy + zz. 



(1.19) 
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I will reserve the letter r for this purpose, throughout the book. Its magnitude, 



r 

is the distance from the origin, and 

r = 

is a unit vector pointing radially outward. The infinitesimal displacement vector, from 
(x, y, z) to (x + dx,y + dy, z +dz), is 

dl = dxx + dyy + dzi. (1-22) 

(We could call this dr, since that’s what it is, but it is useful to reserve a special letter for 
infinitesimal displacements.) 

In electrodynamics one frequently encounters problems involving two points — typically, 
a source point, r', where an electric charge is located, and a field point, r, at which you 
are calculating the electric or magnetic field (Fig. 1 . 1 4). It pays to adopt right from the start 
some short-hand notation for the separation vector from the source point to the field point. 
I shall use for this purpose the script letter 4 : 



— -lx 2 + y 2 + z 2 , 



r _ x x + y y + z i 
r yjx 2 + y 2 + z 2 



( 1 . 20 ) 



( 1 . 21 ) 



^ = r - r . 


(1.23) 


Its magnitude is 




4= |r — r'|, 


(1.24) 


and a unit vector in the direction from r' to r is 








n. 

n. 



r — 

V 1 



r' 

7r 



(1.25) 
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In Cartesian coordinates, 



*=(•*- x')x + (y - y')y + (z - z')z, 


(1.26) 


4 = y/(x - x') 2 + (y- y') 2 + (z - z') 2 , 


(1.27) 


(x - x')x + (y - y')y + ( Z - z')z 

/L = 


(1.28) 


y/(x - x') 2 + (y- y r ) 2 + (z- z') 2 



(from which you can begjn to appreciate the advantage of the scripts notation). 



Problem 1.7 Find the separation vector 4. from the source point (2,8,7) to the field point (4,6,8). 
Determine its magnitude (4), and construct the unit vector i. 



1.1.5 How Vectors Transform 

The definition of a vector as “a quantity with a magnitude and direction” is not altogether 
satisfactory: What precisely does “direction” mean? 2 This may seem a pedantic question, 
but we shall shortly encounter a species of derivative that looks rather like a vector, and 
we’ll want to know for sure whether it is one. You might be inclined to say that a vector 
is anything that has three components that combine properly under addition. Well, how 
about this: We have a barrel of fruit that contains N x pears, N y apples, and N z bananas. 
Is N = N x x + N y y + N z z a vector? It has three components, and when you add another 
barrel with M x pears, M y apples, and M z bananas the result is (N x + M x ) pears, (N y + M y ) 
apples, (N z + M : ) bananas. So it does add like a vector. Yet it’s obviously not a vector, in 
the physicist’s sense of the word, because it doesn’t really have a direction. What exactly 
is wrong with it? 

The answer is that N does not transform properly when you change coordinates. The 
coordinate frame we use to describe positions in space is of course entirely arbitrary, but 
there is a specific geometrical transformation law for converting vector components from 
one frame to another. Suppose, for instance, the x , y, z system is rotated by angle <p, relative 
to v, y, z, about the common x = x axes. From Fig. 1.15, 

A y = A cost?, A z = A sinO, 



while 



Ay = A cos 9 — A cos(6 — cp) = A (cos 0 cos <p + sin0 sin <p) 
= cos 4> A y + sin (pA z , 

A z = A sin 0 = A sin(0 — (p) = A (sin 0 cos (p — cos 0 sin f) 
= — sin f A y + cos <pA z . 



2 This section can be skipped without loss of continuity. 
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Figure 1.15 



We might express this conclusion in matrix notation: 

( Ay\/ cos 4> sin f\(A y \ 

\A Z J \-sin <j> cos 4>J\A Z J- 

More generally, for rotation about an arbitrary axis in three dimensions, the transfor- 
mation law takes the form 

/ ^_x \ / Rxx Rxy Rxz \ / \ 

I I = I Ry* Ryy ^ y z II Av ) > (1.30) 

V A z J \ Rzx Rzy R zz J y A z J 

or, more compactly, 

3 

A, =Y2 R u A n (1.31) 

j = i 

where the index 1 stands for x, 2 for y, and 3 for z. The elements of the matrix R can be 
ascertained, for a given rotation, by the same sort of geometrical arguments as we used for 
a rotation about the x axis. 

Now: Do the components of N transform in this way? Of course not — it doesn’t matter 
what coordinates you use to represent positions in space, there is still the same number of 
apples in the barrel. You can’t^onvert a pear into a banana by choosing a different set of 
axes, but you can turn A x into A y . Formally, then, a vector is any set of three components 
that transforms in the same manner as a displacement when you change coordinates. As 
always, displacement is the model for the behavior of all vectors. 

By the way, a (second-rank) tensor is a quantity with nine components, T xx , T xy , T xz , 
T vx , , T zz . which transforms with two factors of R: 

T xx = Rxx{RxxT xx + R xy T xy + R XZ T XZ ) 

~\~Rxy(RxxT yX + R x yT yy + R xz T yz ) 

+Rxz(RxxT zx + R xy T zy + R X zT zz ), ■ . . 
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or, more compactly, 



3 3 



T u = Y.Y. r * r » t u- 

k= 1 /=! 



(1.32) 



In general, an nth-rank tensor has n indices and 3" components, and transforms with n 
factors of R. In this hierarchy, a vector is a tensor of rank 1, and a scalar is a tensor of rank 
zero. 



Problem 1.8 

(a) Prove that_the two-dimensional rotation matrix (1.29) preserves dot products. (That is, 
show that A y By + A Z ~B Z = A y B y + A Z B Z .) 

(b) What constraints must the elements (Rjj) of the three-dimensional rotation matrix (1.30) 
satisfy in order to preserve the length of A (for all vectors A)? 

Problem 1.9 Find the transformation matrix R that describes a rotation by 120° about an axis 
from the origin through the point (1, 1, 1). The rotation is clockwise as you look down the 
axis toward the origin. 

Problem 1.10 

(a) How do the components of a vector transform under a translation of coordinates (x = x, 
y = y — a,z = z., Fig. 1.16a)? 

(b) How do the components of a vector transform under an inversion of coordinates Qc = -x, 
y = —y, z = -z, Fig. 1.16b)? 

(c) How does the cross product (1.13) of two vectors transform under inversion? [The cross- 
product of two vectors is properly called a pseudovector because of this “anomalous” be- 
havior.] Is the cross product of two pseudovectors a vector, or a pseudovector? Name two 
pseudovector quantities in classical mechanics. 

(d) How does the scalar triple product of three vectors transform under inversions? (Such an 
object is called a pseudoscalar.) 





Figure 1.16 
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1.2 Differential Calculus 



1.2.1 “Ordinary” Derivatives 

Question: Suppose we have a function of one variable: f(x). What does the derivative, 
df /dx, do for us? Answer: It tells us how rapidly the function f(x) varies when we change 
the argument x by a tiny amount, dx: 

df = {%) dx ~ ,U3> 

In words: If we change x by an amount dx, then / changes by an amount df: the derivative 
is the proportionality factor. For example, in Fig. 1.17(a), the function varies slowly with 
v, and the derivative is correspondingly small. In Fig. 1.17(b), / increases rapidly with x, 
and the derivative is large, as you move away from x = 0. 

Geometrical Interpretation: The derivative df/dx is the slope of the graph of / versus x . 




1.2.2 Gradient 



Suppose, now, that we have a function of three variables — say, the temperature T(x, y, z) 
in a room. (Start out in one comer, and set up a system of axes; then for each point (x, y, z) 
in the room, T gives the temperature at that spot.) We want to generalize the notion of 
“derivative” to functions like T, which depend not on one but on three variables. 

Now a derivative is supposed to tell us how fast the function varies, if we move a little 
distance. But this time the situation is more complicated, because it depends on what 
direction we move: If we go straight up, then the temperature will probably increase fairly 
rapidly, but if we move horizontally, it may not change much at all. In fact, the question 
“How fast does T vary?” has an infinite number of answers, one for each direction we 
might choose to explore. 

Fortunately, the problem is not as bad as it looks. A theorem on partial derivatives states 
that 



dT\ 

) dx + 



dT 

dy 



dy + 



dT 
3 z 



dz- 



dT = 



dT 

dx 



(1.34) 
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This tells us how T changes when we alter all three variables by the infinitesimal amounts 
dx, dy, dz. Notice that we do not require an infinite number of derivatives — three will 
suffice: the partial derivatives along each of the three coordinate directions. 

Equation 1.34 is reminiscent of a dot product: 



dT 



( dT „ 9T„ dT f 

— x H y H z 

\ dx dy j dz 

(VT) ■ (dl), 



(dx x + dy y + dzz) 



(1.35) 



where 



9T„ dT „ dT „ 

VT = — xH yH z 

dx dy dz 



(1.36) 



is the gradient of T. VT is a vector quantity, with three components; it is the generalized 
derivative we have been looking for. Equation 1.35 is the three-dimensional version of 
Eq. 1.33. 

Geometrical Interpretation of the Gradient: Like any vector, the gradient has magnitude 
and direction. To determine its geometrical meaning, let’s rewrite the dot product (1 .35) in 
abstract form: 

dT = VT ■d\=\VT\\d\\cos6, (1.37) 



where 6 is the angle between VT and dl. Now, if we fix the magnitude \d\\ and search 
around in various directions (that is, vary 6), the maximum change in T evidentally occurs 
when 6=0 (for then cost? = 1). That is, for a fixed distance \d\\, dT is greatest when I 
move in the same direction as VT. Thus: 



The gradient VT points in the direction of maximum increase of the function 
T. 

Moreover: 



The magnitude |VT| gives the slope (rate of increase) along this maximal 
direction. 



Imagine you are standing on a hillside. Look all around you, and find the direction 
of steepest ascent. That is the direction of the gradient. Now measure the slope in that 
direction (rise over run). That is the magnitude of the gradient. (Here the function we’re 
talking about is the height of the hill, and the coordinates it depends on are positions — 
latitude and longitude, say. This function depends on only two variables, not three, but the 
geometrical meaning of the gradient is easier to grasp in two dimensions.) Notice from 
Eq. 1.37 that the direction of maximum descent is opposite to the direction of maximum 
ascent, while at right angles (6 = 90°) the slope is zero (the gradient is perpendicular to 
the contour lines). You can conceive of surfaces that do not have these properties, but they 
always have “kinks” in them and correspond to nondifferentiable functions. 

What would it mean for the gradient to vanish? If VT = 0 at (x, y, z), then dT = 0 
for small displacements about the point (x, y,z). This is, then, a stationary point of the 
function T(x, y, z). It could be a maximum (a summit), a minimum (a valley), a saddle 
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point (a pass), or a “shoulder.” This is analogous to the situation for functions of one 
variable, where a vanishing derivative signals a maximum, a minimum, or an inflection. In 
particular, if you want to locate the extrema of a function of three variables, set its gradient 
equal to zero. 

Example 1.3 

Find the gradient of r = x 2 + y 2 + z 2 (the magnitude of the position vector). 

Solution; 



Vr = 



dr 



dr 



dr 



„ - x H — y H z 

dx dy d z 



1 



2x 



1 

: *+-- 



2y 



2 Vx 2 +y 2 + Z 2 " ' 2 y / x 2 +y 2 + z 2 y+ 2 y x 2 + y 2 + z 2 
XX + yy + ZZ _ r _ „ 

\!x 2 + y 2 + z 2 r 



2 z 



Does this make sense? Well, it says that the distance from the origin increases most rapidly in 
the radial direction, and that its rate of increase in that direction is 1. . .just what you’d expect. 



Problem 1.11 Find the gradients of the following functions: 

(a) f(x , y, z) = x 2 + y 3 + z 4 . 

(b) f(x, y, z) = x 2 y 2 z 4 . 

(c) f{x, y, z) = e x sin(y) ln(z). 

Problem 1.12 The height of a certain hill (in feet) is given by 

h(x, y) = 10(2 xy - 3x 2 - 4y 2 - 18x + 28y + 12), 

where y is the distance (in miles) north, x the distance east of South Hadley. 

(a) Where is the top of the hill located? 

(b) How high is the hill? 

(c) How steep is the slope (in feet per mile) at a point 1 mile north and one mile east of South 
Hadley? In what direction is the slope steepest, at that point? 

Problem 1.13 Let 4 be the separation vector from a fixed point (a ', v', to the point (a. y, z), 
and let n, be its length. Show that 

(a) V U 2 ) = 24. 

(b) V(l/4) = -IN 2 . 

(c) What is the general formula for V (-!■”) ? 
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Problem 1.14 Suppose that / is a function of two variables (y and z) only. Show that the 
gradient V/ = (3//3y)y + (3//3z)z transforms as a vector under rotations. Eq. 1.29. [Hint: 
(df/dy) = (3//3y)(3y/3y) + ( df/dz)(dz/dy ), and the analogous formula for df/dz. We 
know that y = y cos <p + z sin <p and z = -y sin <p + z cos 0; “solve” these equations for y and 
z (as functions of y and z), and compute the needed derivatives dy/dy, dz/dy, etc.] 



1.2.3 The Operator V 

The gradient has the formal appearance of a vector, V, “multiplying” a scalar T : 



VT 



3 „ 3 „ 3 . _ 

x 1- y 1- z — T. 

dx J dy 3 z} 



(1.38) 



(For once 1 write the unit vectors to the left, just so no one will think this means dx/dx, and 
so on — which would be zero, since x is constant.) The term in parentheses is called “del”: 



„ 3 

V =x— 


„ 3 
+ y — 


„ 3 

+ Z T~- 


dx 


dy 


3 z 



(1.39) 



Of course, del is not a vector, in the usual sense. Indeed, it is without specific meaning until 
we provide it with a function to act upon. Furthermore, it does not “multiply” T ; rather, it 
is an instruction to differentiate what follows. To be precise, then, we should say that V is 
a vector operator that acts upon T, not a vector that multiplies T. 

With this qualification, though, V mimics the behavior of an ordinary vector in virtually 
every way; almost anything that can be done with other vectors can also be done with V, if 
we merely translate “multiply” by “act upon.” So by all means take the vector appearance 
of V seriously: it is a marvelous piece of notational simplification, as you will appreciate if 
you ever consult Maxwell’s original work on electromagnetism, written without the benefit 
of V. 

Now an ordinary vector A can multiply in three ways: 

1. Multiply a scalar a : A a; 

2. Multiply another vector B, via the dot product: A • B; 

3. Multiply another vector via the cross product: A x B. 

Correspondingly, there are three ways the operator V can act: 

1. On a scalar function T : VT (the gradient); 

2. On a vector function v, via the dot product: V • v (the divergence); 

3. On a vector function v, via the cross product: V x v (the curl). 

We have already discussed the gradient. In the following sections we examine the other 
two vector derivatives: divergence and curl. 
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1.2.4 The Divergence 

From the definition of V we construct the divergence: 



V v 



3 ,3 ,3 

dv x dv y dv 7 

-J— — j— 

3x 3 y dz 



(v x x + v y y + v z i) 



(1.40) 



Observe that the divergence of a vector function v is itself a scalar V • v. (You can’t have 
the divergence of a scalar: that’s meaningless.) 

Geometrical Interpretation: The name divergence is well chosen, for V • v is a measure 
of how much the vector v spreads out (diverges) from the point in question. For example, 
the vector function in Fig. 1.18a has a large (positive) divergence (if the arrows pointed in, 
it would be a large negative divergence), the function in Fig. 1 . 1 8b has zero divergence, and 
the function in Fig. 1 . 1 8c again has a positive divergence. (Please understand that v here is 
& function — there’s a different vector associated with every point in space. In the diagrams, 



4 




(a) 



(b) 




t t t t t t t 

(c) 



Figure 1.18 
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of course, I can only draw the arrows at a few representative locations.) Imagine standing 
at the edge of a pond. Sprinkle some sawdust or pine needles on the surface. If the material 
spreads out, then you dropped it at a point of positive divergence; if it collects together, 
you dropped it at a point of negative divergence. (The vector function v in this model is the 
velocity of the water — this is a two-dimensional example, but it helps give one a “feel” for 
what the divergence means. A point of positive divergence is a source, or “faucet”; a point 
of negative divergence is a sink, or “drain.”) 

Example 1.4 

Suppose the functions in Fig. 1.18 arc v„ = r = xx + yy + zz, v/, = z, and v c = zz. 

Calculate their divergences. 

Solution: 



9 9 9 

V • v a = — (*) + —00 + — (z) = 1 + 1 + 1 = 3. 
9x 9y oz 



As anticipated, this function has a positive divergence. 



v ■ y h = ^(0) + ^-(0) + |-(1) = 0 + 0 + 0 = 0, 

ox 9y oz 



as expected. 



9 9 9 

v ■ v c = — (0) + — (0) + — (z) = 0 + 0 + 1 = 1. 

ox oy dz 



Problem 1.15 Calculate the divergence of the following vector functions: 

(a) y a = x 2 x + 3 xz 2 y — 2 xz z. 

(b) \ h = xyx + 2 yz y + 3 zv z. 

(c) v c = y 2 x + (2 xy + z 2 ) y + 2>>z z. 

Problem 1.16 Sketch the vector function 



r 

V= ^’ 

and compute its divergence. The answer may surprise you. . . can you explain it? 

Problem 1.17 In two dimensions, show that the divergence transforms as a scalar under rota- 
tions. [Hint: Use Eq. 1 .29 to determine 1+ and +-. and the method of Prob. 1.14 to calculate 
the derivatives. Your aim is to show that dTiy/dy + dv z /dz = dv v /dy + dv z /d z.] 
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1.2.5 The Curl 



From the definition of V we construct the curl: 



V x v = 



x y z 

3/3x 3/3 y 3/3 z 

v x v y v z 

dvz _ 3t^\ „ / dvx 

3 y dz ) y V 3z 




3uA . /dv y dv x 

dx ) \ 3x 3 y 



(1.41) 



Notice that the curl of a vector function v is, like any cross product, a vector. (You cannot 
have the curl of a scalar; that’s meaningless.) 

Geometrical Interpretation: The name curl is also well chosen, for V x v is a measure 
of how much the vector v “curls around” the point in question. Thus the three functions in 
Fig. 1.18 all have zero curl (as you can easily check for yourself), whereas the functions 
in Fig. 1.19 have a substantial curl, pointing in the z -direction, as the natural right-hand 
rule would suggest. Imagine (again) you are standing at the edge of a pond. Float a small 
paddlewheel (a cork with toothpicks pointing out radially would do); if it starts to rotate, 
then you placed it at a point of nonzero curl. A whirlpool would be a region of large curl. 




Figure 1.19 



Example 1.5 



y b = xy. Calculate their curls. 

Solution: 

V x \ a 



and 



V x = 



;. 1.19a is \ a 


= -yx + xy, 


X 


y 


z 




d/dx 


3/3 y 


3/3 z 


= 2z. 


-y 


X 


0 




X 


y 


z 




d/dx 


3/3.V 


3/3z 


= z. 


0 


X 


0 
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As expected, these curls point in the +z direction. (Incidentally, they both have zero divergence, 
as you might guess from the pictures: nothing is “spreading out”. . . it just “curls around”) 



Problem 1.18 Calculate the curls of the vector functions in Prob. 1.15. 

Problem 1.19 Construct a vector function that has zero divergence and zero curl everywhere. . 
(A constant will do the job, of course, but make it something a little more interesting than 
that!) 



1.2.6 Product Rules 



The calculation of ordinary derivatives is facilitated by a number of general rules, such as 
the sum rule: 



Ll 

+ s) 

ax 



df_ dg_ 

dx dx 



the rule for multiplying by a constant: 



d 

dx 



(*/) 




the product rule: 



d dg df 

dx dx dx 



and the quotient rule: 

df f dg_ 

d ( /\ _ g dx dx 
dx\g) g 2 

Similar relations hold for the vector derivatives. Thus, 



V(/ + g ) = V/ + Vg, V • (A + B) = (V ■ A) + (V • B), 

V x (A + B) = (V x A) + (V x B), 

and 

V(kf)=kVf, V • (kA) — k(V ■ A), V x (kA) — k(V x A), 

as you can check for yourself. The product rules are not quite so simple. There are two 
ways to construct a scalar as the product of two functions: 

fg (product of two scalar functions), 

A ■ B (dot product of two vector functions), 

and two ways to make a vector: 

/ A (scalar times vector), 

A x B (cross product of two vectors). 
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Accordingly, there are six product rules, two for gradients: 

W V(fg) = fVg+gVf, 

(ii) V(A • B) = A x (V x B) + B x (V x A) + (A ■ V)B + (B • V)A, 
two for divergences: 

(iii) V-(/A) = /(V-A)+A-(V/), 

(iv) V ■ (A x B) = B • (V x A) - A • (V x B), 
and two for curls: 

(v) V x (/A) = /(V x A) - A x (V/), 



(vi) V x (A x B) = (B • V)A - (A • V)B + A(V • B) - B(V • A). 

You will be using these product rules so frequently that I have put them on the inside front 
cover for easy reference. The proofs come straight from the product rule for ordinary 
derivatives. For instance, 



V • (/A) 



+ g7/ A z) 



dx 



- ,'V A JA X 

- [dx Ax + f ^7 



+ 



d _l A 

ay y 



+ /^ 

3y 



(V/)-A + /(V-A). 



3/ 

T-Az + f- 

3 z 



. 9 Az 
3 z 



It is also possible to formulate three quotient rules: 



V 
V • 
V x 



7 

g 

A 

g 

A 

g 



gVf-fVg 

g 2 ’ 

g(y • A) - A • (Wg) 

2 ’ 
g z 

g(V x A) + A x (Vg) 



However, since these can be obtained quickly from the corresponding product rules, I 
haven’t bothered to put them on the inside front cover. 
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Problem 1.20 Prove product rules (i), (iv), and (v). 

Problem 1.21 

(a) If A and B are two vector functions, what does the expression (A ■ V)B mean? (That is, 
what are its x, y, and z components in terms of the Cartesian components of A, B, and V ?) 

(b) Compute (r • V)r, where r is the unit vector defined in Eq. 1.21. 

(c) For the functions in Prob. 1.15, evaluate (v a ■ V)v^. 

Problem 1.22 (For masochists only.) Prove product rules (ii) and (vi). Refer to Prob. 1 .2 1 for 
the definition of (A ■ V)B. 

Problem 1.23 Derive the three quotient rules. 

Problem 1.24 

(a) Check product rule (iv) (by calculating each term separately) for the functions 

A = x x + 2y y + 3zz; B = 3yx — 2xy. 

(b) Do the same for product rule (ii). 

(c) The same for rule (vi). 



1.2.7 Second Derivatives 



The gradient, the divergence, and the curl are the only first derivatives we can make with 
V ; by applying V twice we can construct five species of second derivatives. The gradient 
V T is a vector, so we can take the divergence and curl of it: 

(1) Divergence of gradient: V ■ (VT). 

(2) Curl of gradient: V x (VT). 

The divergence V • v is a scalar — all we can do is take its gradient: 

(3) Gradient of divergence: V(V • v). 

The curl V x v is a vector, so we can take its divergence and curl: 

(4) Divergence of curl: V ■ (V x v). 

(5) Curl of curl: V x (V x v). 



This exhausts the possibilities, and in fact not all of them give anything new. Let’s 
consider them one at a time: 



(1) V ■ (VT) 



/„ 3 „ 3 „ a ' 

x b y 1- z — 

\ dx dy dz , 

9 2 T 9 2 T 9 2 T 

dx 2 dy 2 dz 2 



9T „ 9T„ 

— x-l y + 

dx dy 



9T 

dz 



(1.42) 
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This object, which we write V 2 T for short, is called the Laplacian of T ; we shall be 
studying it in great detail later on. Notice that the Laplacian of a scalar T is a scalar. 
Occasionally, we shall speak of the Laplacian of a vector, V 2 v. By this we mean a vector 
quantity whose x -component is the Laplacian of v x , and so on: 3 

V 2 v = (V 2 v x )x + (V 2 v y )y + (V 2 v z )z. (1.43) 

This is nothing more than a convenient extension of the meaning of V 2 . 

(2) The curl of a gradient is always zero: 



V x (VT) =0. 



(1.44) 



This is an important fact, which we shall use repeatedly; you can easily prove it from the 
definition of V, Eq. 1.39. Beware: You might think Eq. 1.44 is “obviously” true — isn’t it 
just (V x V)T, and isn’t the cross product of any vector (in this case, V) with itself always 
zero? This reasoning is suggestive but not quite conclusive, since V is an operator and does 
not “multiply” in the usual way. The proof of Eq. 1 .44, in fact, hinges on the equality of 
cross derivatives: 



a /ar\ _ a /ar 

dx \dy ) dy \ dx 



(1.45) 



If you think I’m being fussy, test your intuition on this one: 



( VT ) x (VS). 



Is that always zero? (It would be, of course, if you replaced the V ’s by an ordinary vector.) 

(3) V(V • v) for some reason seldom occurs in physical applications, and it has not been 
given any special name of its own — it’s just the gradient of the divergence. Notice that 
V(V • v) is not the same as the Laplacian of a vecfor: V 2 v = (V • V)v V(V • v). 

(4) The divergence of a curl, like the curl of a gradient, is always zero: 

V ■ (V x v) = 0. (1.46) 

You can prove this for yourself. (Again, there is a fraudulent short-cut proof, using the 
vector identity A • (B x C) = (A x B) • C.) 

(5) As you can check from the definition of V : 

V x (V x v) = V(V • v) - V 2 v. (1.47) 

So curl-of-curl gives nothing new; the first term is just number (3) and the second is the 
Laplacian (of a vector). (In fact, Eq. 1.47 is often used to define the Laplacian of a vector, 
in preference to Eq. 1.43, which makes specific reference to Cartesian coordinates.) 



Really, then, there are just two kinds of second derivatives: the Laplacian (which is 
of fundamental importance) and the gradient-of-divergence (which we seldom encounter). 

3 In curvilinear coordinates, where the unit vectors themselves depend on position, they too must be differentiated 
(see Sect. 1.4.1). 
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We could go through a similar ritual to work out third derivatives, but fortunately second 
derivatives suffice for practically all physical applications. 

A final word on vector differential calculus: It all flows from the operator V, and from 
taking seriously its vector character. Even if you remembered only the definition of V, you 
should be able, in principle, to reconstruct all the rest. 



Problem 1.25 Calculate the Laplacian of the following functions: 

(a) T a = x 2 + 2xy + 3z + 4. 

(b) Tj, = sin* siny sinz. 

(c) T c = e~^ x sin4ycos3z. 

(d) v = x 2 x + 3*z 2 y — 2*z z. 

Problem 1.26 Prove that the divergence of a curl is always zero. Check it for function v„ in 
Prob. 1.15. 

Problem 1.27 Prove that the curl of a gradient is always zero. Check it for function (b) in 
Prob. 1.11. 



L.3 Integral Calculus 



1.3.1 Line, Surface, and Volume Integrals 

In electrodynamics we encounter several different kinds of integrals, among which the most 
important are line (or path) integrals, surface integrals (or flux), and volume integrals, 
(a) Line Integrals. A line integral is an expression of the form 




(1.48) 



where v is a vector function, dl is the infinitesimal displacement vector (Eq. 1.22), and the 
integral is to be carried out along a prescribed path V from point a to point b (Fig. 1 .20). If 
the path in question forms a closed loop (that is, if b = a), I shall put a circle on the integral 
sign: 



t 



v • dl. 



(1.49) 



At each point on the path we take the dot product of v (evaluated at that point) with the 
displacement d\ to the next point on the path. To a physicist, the most familiar example of 
a line integral is the work done by a force F: W = f F ■ c/1. 

Ordinarily, the value of a line integral depends critically on the particular path taken 
from a to b, but there is an important special class of vector functions for which the line 
integral is independent of the path, and is determined entirely by the end points. It will be 
our business in due course to characterize this special class of vectors. (A force that has 
this property is called conservative.) 
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Figure 1 .20 



Figure 1.21 



Example 1.6 



Calculate the line integral of the function v = y 2 x + 2x(y + 1 ) y from the point a = (1 , 1 , 0) 
to the point b = (2, 2, 0), along the paths (1) and (2) in Fig. 1.21. What is / v • d\ for the loop 
that goes from a to b along (1) and returns to a along (2)? 

Solution: As always, d\ = dxx + dyy + dzi. Path (1) consists of two parts. Along the 
“horizontal” segment dy = dz = 0, so 

(i) dl = dx x, y = 1 , v • d\ = y 2 dx = dx, so f v • dl = f 2 dx = 1 . 

On the “vertical” stretch dx = dz = 0, so 



(ii) dl = dyy, x = 2, v ■ dl = 2x(y + 1) dy = 4(y + 1 )dy, so 



fy-dl 

By path (1), then, 



4 if O' + 1 )dy = 10. 



f 



vd\= 1 + 10 = 11 . 



Meanwhile, on path (2) x = y, dx = dy, and dz = 0, so 
dl = dx x + dx y, v • dl = x 2 dx + 2x(x + 1) dx = (3x 2 + 2x) dx, 
so 



/>-/ 



= j (3x + 2x) dx = (jr + = 10. 

(The strategy here is to get everything in terms of one variable; I could just as well have 
eliminated v in favor of y .) 

For the loop that goes out (1) and back (2), then, 



v ■ dl — 11-10= 1. 
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(b) Surface Integrals. A surface integral is an expression of the form 

j v • da, (1.50) 

where v is again some vector function, and da is an infinitesimal patch of area, with direction 
perpendicular to the surface (Fig. 1.22). There are, of course, two directions perpendicular 
to any surface, so the sign of a surface integral is intrinsically ambiguous. If the surface is 
closed (forming a '“balloon"), in which case I shall again put a circle on the integral sign 

<j) v • da, 

then tradition dictates that “outward” is positive, but for open surfaces it’s arbitrary. If v 
describes the flow of a fluid (mass per unit area per unit time), then f v ■ da represents the 
total mass per unit time passing through the surface — hence the alternative name, “flux.” 
Ordinarily, the value of a surface integral depends on the particular surface chosen, but 
there is a special class of vector functions for which it is independent of the surface, and 
is determined entirely by the boundary line. We shall soon be in a position to characterize 
this special class. 





Example 1.7 

Calculate the surface integral of v = 2xz x + (x + 2) y + y (z 2 — 3) z over five sides (excluding 
the bottom) of the cubical box (side 2) in Fig. 1.23. Let “upward and outward” be the positive 
direction, as indicated by the arrows. 

Solution: Taking the sides one at a time: 

(i) x = 2, da = dy dzx, v ■ da = 2 xz dy dz = 4 z dy dz, so 

f v • da = 4 f dy f zdz = 16. 

J Jo Jo 

(ii) x = 0, da = — dy dzii, y-da — —2 xz dy dz = 0, so 
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(iii) y = 2, da = dx dzy, v ■ da = ( x + 2) dx dz, so 

f v • da = f (x + 2) dx f dz = 12. 

J Jo Jo 

(iv) y = 0, da = —dx dzy, v • da = — ix + 2) dx dz, so 

f y ■ da = — f (x + 2) dx f dz = — 12. 

7 7o 7o 

(v) z = 2, da = dx dy z, y ■ da = y(z 2 — 3) t/x dy = v t/x so 



/ y ■ da = j dx I ydy = 4. 

Jo Jo 



Evidently the total flux is 



L 



v da = 16 + 0+ 12- 12 + 4 = 20. 



surface 



(c) Volume Integrals. A volume integral is an expression of the form 



/ T dz, (1.51) 

Jv 

where T is a scalar function and dz is an infinitesimal volume element. In Cartesian 
coordinates, 

dz = dxdy dz, (1-52) 

For example, if T is the density of a substance (which might vary from point to point), then 
the volume integral would give the total mass. Occasionally we shall encounter volume 
integrals of vector functions: 



J ydz = j (v x 



, z)dz = x J v x dz + y J Vydz + z j 



v z dz; 



(1.53) 



because the unit vectors are constants, they come outside the integral. 

Example 1.8 

Calculate the volume integral of T = xyz 2 over the prism in Fig. 1.24. 

Solution: You can do the three integrals in any order. Let’s do x first: it runs from 0 to (1 — y); 
then y (it goes from 0 to 1); and finally z (0 to 3): 



/ 



1 

2 





(1 - y) 2 ydy = 2(9) + ) = §. 
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Figure 1.24 



Problem 1.28 Calculate the line integral of the function v = x 2 x + 2yz v + y~z from the 
origin to the point (1,1,1) by three different routes: 

(a) (0,0,0) (1,0,0) (1, 1,0) (1, 1, 1); 

(b) (0,0,0) (0,0, 1) (0, 1, 1) (1, 1, 1); 

(c) The direct straight line. 

(d) What is the line integral around the closed loop that goes out along path (a) and back along 
path (b)? 

Problem 1.29 Calculate the surface integral of the function in Ex. 1 .7, over the bottom of the 
box. For consistency, let “upward” be the positive direction. Does the surface integral depend 
only on the boundary line for this function? What is the total flux over the closed surface of the 
box ( including the bottom)? [Note: For the closed surface the positive direction is “outward,” 
and hence “down,” for the bottom face.] 

Problem 1.30 Calculate the volume integral of the function T = z 2 over the tetrahedron with 
comers at (0,0,0), (1,0,0), (0,1,0), and (0,0,1). 



1.3.2 The Fundamental Theorem of Calculus 

Suppose fix) is a function of one variable. The fundamental theorem of calculus states: 

f b ‘j-dx = f{b) - f(a). (1.54) 

J a dx 

In case this doesn’t look familiar, let’s write it another way: 

b 

Fix ) dx = f{b) - fia), 

where df/dx = Fix). The fundamental theorem tells you how to integrate F(x): you 
think up a function fix) whose derivative is equal to F. 
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Geometrical Interpretation: According to Eq. 1.33, df = (dfjdx)dx is the infinitesi- 
mal change in / when you go from (x) to (x + dx). The fundamental theorem (1.54) says 
that if you chop the interval from a to b (Fig. 1.25) into many tiny pieces, dx, and add up 
the increments df from each little piece, the result is (not surprisingly) equal to the total 
change in /: f(b) — f(a). In other words, there are two ways to determine the total change 
in the function: either subtract the values at the ends or go step-by-step, adding up all the 
tiny increments as you go. You’ll get the same answer either way. 

Notice the basic format of the fundamental theorem: the integral of a derivative over 
an interval is given by the value of the function at the end points ( boundaries ). In vector 
calculus there are three species of derivative (gradient, divergence, and curl), and each has 
its own “fundamental theorem,” with essentially the same format. I don’t plan to prove 
these theorems here; rather, I shall explain what they mean, and try to make them plausible. 
Proofs are given in Appendix A. 




Figure 1.25 



Figure 1.26 



1.3.3 The Fundamental Theorem for Gradients 

Suppose we have a scalar function of three variables T (x, y, z). Starting at point a, we 
move a small distance d\\ (Fig. 1.26). According to Eq. 1.37, the function T will change 
by an amount 

dT = (VT) -d\i. 

Now we move a little further, by an additional small displacement dty, the incremental 
change in T will be ( V7’) • d\ 2 . In this manner, proceeding by infinitesimal steps, we make 
the journey to point b. At each step we compute the gradient of T (at that point) and dot it 
into the displacement dl. . . this gives us the change in T. Evidently the total change in T 
in going from a to b along the path selected is 




V 



(VT)-dl = T (b) 



T( a). 



(1.55) 
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This is called the fundamental theorem for gradients; like the “ordinary” fundamental 
theorem, it says that the integral (here a line integral) of a derivative (here the gradient ) is 
given by the value of the function at the boundaries (a and b). 

Geometrical Interpretation: Suppose you wanted to determine the height of the Eiffel 
Tower. You could climb the stairs, using a ruler to measure the rise at each step, and adding 
them all up (that’s the left side of Eq. 1.55), or you could place altimeters at the top and 
the bottom, and subtract the two readings (that’s the right side); you should get the same 
answer either way (that’s the fundamental theorem). 

Incidentally, as we found in Ex. 1.6, line integrals ordinarily depend on the path taken 
from a to b. But the right side of Eq. 1.55 makes no reference to the path — only to the 
end points. Evidently, gradients have the special property that their line integrals are path 
independent: 

Corollary 1: f* (V T) ■ d\ is independent of path taken from a to b. 

Corollary 2: /(V T) ■ dl — 0, since the beginning and end points 

are identical, and hence T(b) — T (a) = 0. 



Example 1.9 

Let T = xy 2 , and take point a to be the origin (0, 0. 0) and b the point (2, 1,0). Check the 
fundamental theorem for gradients. 

Solution: Although the integral is independent of path, we must pick a specific path in order 
to evaluate it. Let’s go out along the x axis (step i) and then up (step ii) (Fig. 1.27). As always, 
dl = dxx + dy y + dzi',VT = y 2 x + 2xy y . 

(i) y = 0; dl = dxx, V T ■ dl = y 2 dx = 0, so 

fvT • dl = 0. 



(ii) x =2\ dl = dy y, VT ■ dl = 2 xy dy = Ay dy, so 



L 



VT dl = 




2 y 2 



i 

= 2 . 

0 




Figure 1.27 
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Evidently the total line integral is 2. Is this consistent with the fundamental theorem? Yes: 
r(b) - T( a) =2-0 = 2. 



Now, just to convince you that the answer is independent of path, let me calculate the same 
integral along path iii (the straight line from a to b): 

(iii) y = jx, dy = | dx. VT ■ dl = y 2 dx + 2 xydy = | x 2 dx, so 



f VT d\= ( 2 

J iii JO 



I* 2 dx = ix 3 



= 2 . 



Problem 1.31 Check the fundamental theorem for gradients, using T = x 2 + 4 xy + 2 yz 2 , the 
points a = (0, 0, 0), b = (1 , 1, 1), and the three paths in Fig. 1.28: 

(a) (0,0,0) -> (1.0,0) (I. 1,0) (1, 1. 1); 

(b) (0,0.0) (0,0, 1) ^ (0, 1, 1) ^ (1, 1, 1); 

(c) the parabolic path z = x 2 \ y = x. 






1.3.4 The Fundamehtal Theorem for Divergences 

The fundamental theorem for divergences states that: 



J (V • v) dr = 


j 2 v • da. 


V 


S 



( 1 . 56 ) 



In honor, I suppose of its great importance, this theorem has at least three special names: 
Gauss’s theorem, Green’s theorem, or, simply, the divergence theorem. Like the other 
“fundamental theorems,” it says that the integral of a derivative (in this case the divergence ) 
over a region (in this case a volume ) is equal to the value of the function at the boundary 
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(in this case the surface that bounds the volume). Notice that the boundary term is itself 
an integral (specifically, a surface integral). This is reasonable: the “boundary” of a line is 
just two end points, but the boundary of a volume is a (closed) surface. 

Geometrical Interpretation: If v represents the flow of an incompressible fluid, then 
the flux of v (the right side of Eq. 1 .56) is the total amount of fluid passing out through the 
surface, per unit time. Now, the divergence measures the “spreading out” of the vectors 
from a point — a place of high divergence is like a “faucet,” pouring out liquid. If we have 
lots of faucets in a region filled with incompressible fluid, an equal amount of liquid will 
be forced out through the boundaries of the region. In fact, there are two ways we could 
determine how much is being produced: (a) we could count up all the faucets, recording 
how much each puts out, or (b) we could go around the boundary, measuring the flow at 
each point, and add it all up. You get the same answer either way: 

J (faucets within the volume) = j) (flow out through the surface). 

This, in essence, is what the divergence theorem says. 

Example 1.10 

Check the divergence theorem using the function 

v = y 2 x + (2 xy + z 2 ) y + (2 yz) i 



and the unit cube situated at the origin (Fig. 1.29). 



Solution: In this case 



V • v = 2(x + y), 



f 2(x + y) dr = 2 f f f (x + y)dxdydz, 

J v Jo Jo Jo 

f (x + y)dx = \ +y, f (\+y)dy = 1 , f \dz = \. 

Jo Jo Jo 




Figure 1 .29 
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Evidently, 


/ V • ydz = 2. 
V 




So much for the left side of the divergence theorem. To evaluate the surface integral we must 
consider separately the six sides of the cube: 


(i) 


j v ■ da = j j y 2 dy dz = 




(ii) 


j v da = —J j y 2 dydz = —\. 




(iii) 


j\da = j j (2x + z 2 ) dx dz = \. 




(iv) 


j v • da = — j J z 2 dxdz = —\. 




(v) 


I v • da = I 1 2ydxdy = l. 

J Jo Jo 




(vi) 


f \ ■ da = — f f Odx dy = 0. 

J Jo Jo 




So the total flux is: 


^ v ■ da = j — j + | — ^ + 1+ 0 = 2, 
5 




as expected. 







Problem 1.32 Test the divergence theorem for the function v = (xy) x + (2 yz) y + (3 zx) z. 
Take as your volume the cube shown in Fig. 1.30, with sides of length 2. 





34 



CHAPTER 1. VECTOR ANALYSIS 




1.3.5 The Fundamental Theorem for Curls 



The fundamental theorem for curls, which goes by the special name of Stokes’ theorem, 
states that 



/ 



(V x y) ■ d a = 



£ v • dl. 

V 



(1.57) 



As always, the integral of a derivative (here, the curl) over a region (here, a {latch of surface) 
is equal to the value of the function at the boundary (hbre, the perimeter of the patch). As 
in the case of the divergence theorem, the boundary term is itself an integral — specifically, 
a closed line integral. 

Geometrical Interpretation: Recall that the curl measures the “twist” of the vectors v; a 
region of high curl is a whirlpool — if you put a tiny paddle wheel there, it will rotate. Now, 
the integral of the curl over some surface (or, more precisely, the flux of the curl through 
that surface) represents the “total amount of swirl,” and we can determine that swirl just as 
well by going around the edge and finding how much the flow is following the boundary 
(Fig. 1.31). You may find this a rather forced interpretation of Stokes’ theorem, but it’s a 
helpful mnemonic, if nothing else. 

You might have noticed an apparent ambiguity in Stokes’ theorem: concerning the 
boundary line integral, which way are we supposed to go around (clockwise or counter- 
clockwise)? If we go the “wrong” way we’ll pick up an overall sign error. The answer is 
that it doesn’t matter which way you go as long as you are consistent, for there is a com- 
pensating sign ambiguity in the surface integral: Which way does da point? For a closed 
surface (as in the divergence theorem) da points in the direction of the outward normal; but 
for an open surface, which way is “out?” Consistency in Stokes’ theorem (as in all such 
matters) is given by the right-hand rule: If your fingers point in the direction of the line 
integral, then your thumb fixes the direction of da (Fig. 1.32). 

Now, there are plenty of surfaces (infinitely many) that share any given boundary line. 
Twist a paper clip into a loop and dip it in soapy water. The soap film constitutes a surface, 
with the wire loop as its boundary. If you blow on it, the soap film will expand, making 
a larger surface, with the same boundary. Ordinarily, a flux integral depends critically on 
what surface you integrate over, but evidently this is not the case with curls. For Stokes’ 
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(O) 



Figure 1.31 



Figure 1.32 



theorem says that /(V x v) • da is equal to the line integral of v around the boundary, and 
the latter makes no reference to the specific surface you choose. 

Corollary 1: /(V x v) ■ da depends only on the boundary line, not 

on the particular surface used. 

Corollary 2: <p (V x v) ■ da — 0 for any closed surface, since the 

boundary line, like the mouth of a balloon, shrinks 
down to a point, and hence the right side of Eq. 1.57 
vanishes. 

These corollaries are analogous to those for the gradient theorem. We shall develop the 
parallel further in due course. 



Example 1.11 



Suppose v = (2 xz + 3y 2 )y + (4yz 2 )z. Check Stokes’ theorem for the square surface shown 
in Fig. 1.33. 

Solution: Here 

V x v = (4z 2 - 2x) x + 2zz and da = dydzx. 



Z ' 

1 


(hi) 








(iv)l 




t(ii) 


V 


(5 1 


1 y 



Figure 1.33 
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(In saying that da points in the x direction, we are committing ourselves to a counterclockwise 
line integral. We could as well write da = -dy dz x, but then we would be obliged to go 
clockwise.) Since x = 0 for this surface, 

/ (V x v) • da = / j 4 z 2 dydz=\. 

Now, what about the line integral? We must break this up into four segments: 



(i) 


* =0, 


z = 0 , 


v • d\ = 3 y 2 dy, 


/v 


' d] = Jo ^y 2d y = i. 


(ii) 


x = 0, 


y = 1, 


v ■ d\ = 4z 2 dz. 


/v 


■ d\ = /q 4z 2 dz = 5 , 


(iii) 


X =0, 


z = 1, 


v ■ d\ = 3y 2 dy, 


/v 


' dl = fi 3y 2 dy = -1, 


(iv) 


x = 0. 


y = 0, 


o' 

II 

> 


h- 


■dl = jf Odz = 0. 



So 

j)v-d\ = \ + j- l + 0=j. 

It checks. 

A point of strategy: notice how I handled step (iii). There is a temptation to write d\ = —dy y 
here, since the path goes to the left. You can get away with this, if you insist, by running the 
integral from 0 — ► 1 . Personally, I prefer to say d\ = dxx + dyy + dzi always (never any 
minus signs) and let the limits of the integral take care of the direction. 



Problem 1.33 Test Stokes’ theorem for the function v = (xy) x + (2 yz) y + (3 zx) z, using the 
triangular shaded area of Fig. 1 .34. 

Problem 1.34 Check Corollary I by using the same function and boundary line as in Ex. 1.11, 
but integrating over the five sides of the cube in Fig. 1 .35. The back of the cube is open. 




Figure 1.34 



Figure 1.35 
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1.3.6 Integration by Parts 



The technique known (awkwardly) as integration by parts exploits the product rule for 
derivatives: 



d ( dg 

- r {fg) = f[-r 
dx V dx 



+ g 



d J_ 

dx 



Integrating both sides, and invoking the fundamental theorem: 



or 




(1.58) 



That’s integration by parts. It pertains to the situation in which you are called upon to 
integrate the product of one function (/) and the derivative of another (g): it says you can 
transfer the derivative from g to f, at the cost of a minus sign and a boundary term. 



Example 1.12 

Evaluate the integral 



r 

Jo 



xe x dx. 



Solution: The exponential can be expressed as a derivative: 



= — (-e ~ x ) ; 
dx v ’ 



in this case, then, f(x) = x, g(x) = — e x , and df/dx = 1, so 

poo poo I " " 

I xe~ x dx = I e~ x dx — xe~ x ' 

Jo Jo 



oo 

= 1 . 

0 



We can exploit the product rules of vector calculus, together with the appropriate fun- 
damental theorems, in exactly the same way. For example, integrating 

V-(/A) = /(V-A)+A.(V/) 

over a volume, and invoking the divergence theorem, yields 

J V • (fA)dr = j /(V • A)dz + J A ■ (V/) dz = <j> /A • da, 

f /(V ■ A) dr = - f A ■ ('Vf)dz + (f /A • da. 

Jv Jv Js 



or 



(1.59) 
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Here again the integrand is the product of one function (/) and the derivative (in this case 
the divergence) of another (A), and integration by parts licenses us to transfer the derivative 
from A to / (where it becomes a gradient), at the cost of a minus sign and a boundary term 
(in this case a surface integral). 

You might wonder how often one is likely to encounter an integral involving the product 
of one function and the derivative of another; the answer is surprisingly often, and integration 
by parts turns out to be one of the most powerful tools in vector calculus. 



Problem 1.35 

(a) Show that 

(b) Show that 



/(V x A) • da = J [A x (V /)] ■ da + j) /A • d\. 



I B • (V x A) dr = I A • (V x B) dr + ® (A x B) • da. 

Jv Jv Js 



( 1 . 60 ) 



( 1 . 61 ) 



1.4 Curvilinear Coordinates 

1.4.1 Spherical Polar Coordinates 

The spherical polar coordinates (r, 0, <f>) of a point P are defined in Fig. 1.36; r is the 
distance from the origin (the magnitude of the position vector), 0 (the angle down from the 
z axis) is called the polar angle, and <j> (the angle around from the x axis) is the azimuthal 
angle. Their relation to Cartesian coordinates ( x , y, z.) can be read from the figure: 

x — r sin 0 cos 0, y = r sin# sin 0, z = rcos9. (1-62) 




Figure 1.36 
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Figure 1.36 also shows three unit vectors, r, 6. (j>, pointing in the direction of increase 
of the corresponding coordinates. They constitute an orthogonal (mutually perpendicular) 
basis set (just like x, y, z), and any vector A can be expressed in terms of them in the usual 
way: 

A = A r r + Ag 6 + A,p <!>. (1.63) 

A r , Aq, and are the radial, polar, and azimuthal components of A. In terms of the 
Cartesian unit vectors. 



r = sin 0 cos </> x + sin 6 sin </> y + cos 0 z, 

6 = cos 0 cos </> x + cos 6 sin (j>$ — sin 6 z, 
<j> = — sin</>x + cos</>y, 



(1.64) 



as you can easily check for yourself (Prob. 1 .37). I have put these formulas inside the back 
cover, for easy reference. 

But there is a poisonous snake lurking here that I’d better warn you about: r, 6 , and $ 
are associated with a particular point P , and they change direction as P moves around. For 
example, r always points radially outward, but “radially outward” can be the jc direction, 
the y direction, or any other direction, depending on where you are. In Fig. 1 .37, A = y and 
B = — y, and yet both of them would be written aS r in spherical coofdinates. One could 
take account of this by explicitly indicating the point of reference: r(0, <f>), 0(0. <f>), 4>(0. </>), 
but this would be cumbersome, and as long as you are alert to the problem I don’t think it 
will cause difficulties. 4 In particular, do not naively combihe the spherical components of 
vectors associated with different points (in Fig. 1 .37, A + B = 0, not 2? , and A • B = - 1 , 
not +1). Beware of differentiating a vector that is expressed in spherical coordinates, since 
the unit vectors themselves are functions of position (3f/3 0 = 0, for example). And do 
not take r, 6, and 4> outside an integral, as we did with x, y, and z in Eq. 1.53. In general, 
if you’re uncertain about the validity of an operation, reexpress the problem in Cartesian 
coordinates, where this difficulty does not arise. 




Figure 1.37 



4 I claimed on the very first page that vectors have no location, and I’ll stand by that. The vectors themselves 
live “out there,” completely independertt of our choice of coordinates. But the notation we use to represent them 
does depend on the point in question, in curvilinear coordinates. 
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An infinitesimal displacement in the r direction is simply dr (Fig. 1.38a), just as an 
infinitesimal element of length in the x direction is dx : 

dl r = dr. (1.65) 

On the other hand, an infinitesimal element of length in the 0 direction (Fig. 1.38b) is not 
just d9 (that’s an angle — it doesn’t even have the right units for a length), but rather r d9: 

dlg=rd0. (1.66) 

Similarly, an infinitesimal element of length in the 0 direction (Fig. 1 ,38c) is r sin 0 dtp-. 

dl^ = r sin 9 dtp. (1-67) 

Thus, the general infinitesimal displacement dl is 

d\ = dr r + r d9 0 + r sin 9 dtp <j>. (1.68) 

This plays the role (in line integrals, for example) that d\ = dx x + dy y + dz z played in 
Cartesian coordinates. 



r dQ 



(a) (b) (c) 

Figure 1.38 

The infinitesimal volume element d r, in spherical coordinates, is the product of the 
three infinitesimal displacements: 

dx = dl r dig dlrf, = r 2 sin 9 dr d9 dtp . (1-69) 

I cannot give you a general expression for surface elements da, since these depend on the 
orientation of the surface. You simply have to analyze the geometry for any given case (this 
goes for Cartesian and curvilinear coordinates alike). If you are integrating over the surface 
of a sphere, for instance, then r is constant, whereas 9 and tp change (Fig. 1.39), so 

da\ = dig dl ^ r = r 2 sin 9 d9 dtp r. 

On the other hand, if the surface lies in the xy plane, say, so that 9 is constant (to wit: tt/2) 
while r and tp vary, then 

da 2 — dl r dl,), 6 = r dr dtp 6. 




rsin0 
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Figure 1.39 



Notice, finally, that r ranges from 0 to oo, <£ from 0 to 2n , and 9 from 0 to n ( not 
2n — that would count every point twice). 5 

Example 1.13 



Find the volume of a sphere of radius R. 

Solution: 



V 



(Not a big surprise.) 



rR p r r2n 

dr = / / / r^sinOdrdOdip 

Jr = 0 J0 = 0 J<t >= 0 




So far we have talked only about the geometry of spherical coordinates. Now I would 
like to “translate” the vector derivatives (gradient, divergence, curl, and Laplacian) into r, 
9, <j> notation. In principle this is entirely straightforward: in the case of the gradient, 



9F 97\ 97\ 

VT = — x4 yH z, 

dx dy y dz 



for instance, we would first use the chain rule to reexpress the partials: 



9F_9F/9r\ 9T /96>\ dT /9<A 
dx dr \9x ) + d9 \9x ) + d<f> \dx ) ' 

5 Alternatively, you could run from 0 to jr (the “eastern hemisphere”) and cover the “western hemisphere” by 
extending 6 from j r up to 2 n . But this is very bad notation, since, among other things, sin 8 will then run negative, 
and you’ll have to put absolute value signs around that term in volume and surface elements (area and volume 
being intrinsically positive quantities). 
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The terms in parentheses could he worked out from Eq. 1.62 — or rather, the inverse of 
those equations (Prob. 1.36). Theq we’d do the same for 3 T /3 y and 3 T /3 z. Finally, we’d 
substitute in the formulas for x, y, and z in terms of r, 8, and </> (Prob. 1 .37). It would take 
an hour to figure out the gradient in spherical coordinates by this brute-force method. I 
suppose this is how it was first done, but there is a much more efficient indirect approach, 
explained in Appendix A, which has the extra advantage of treating all coordinate systems 
at once. I described the “straightforward” method only to show you that there is nothing 
subtle or mysterious about transforming to spherical coordinates: you’re expressing the 
same quantity (gradient, divergence, or whatever) in different notation, that’s all. 

Here, then, are the vector derivatives in spherical coordinates: 



Gradient: 



Divergence: 



37 \ 13 T f. 1 3 7 , 

— r 4 6 4 tp 

dr r dO r sin 9 3 tp 



(1-70) 



V 



1 3 

r 2 dr 



(r 2 v r ) 4- 



1 3 



r sin 6 3 6 



(sin 6%) 4- 



1 

r sin 6 dtp 



(1.71) 



Curl: 



V x v 



1 


' 3 


r sin0 


_30 


1 


r 3 




r 


ir\rvg) 

dr 



(sin 0i^) 



3 IV 
"30" 



dve 

dtp 



r + 



1 3iy 
sin0 dtp 



3 

Yr^ 



8 



(1.72) 



Laplacian: 



1 3 / 2 3T\ 1 3 / in(9 9T \ 1 d2 T 

r 2 dr \ 3r ) "* *" r 2 sin 6 30 \ 30 ) + r 2 sin 2 0 dtp 2 



(1.73) 



For reference, these formulas are listed inside the front cover. 



Problem 1.36 Find formulas for r, 8, tp in terms of x, y, z (the inverse, in other words, of 
Eq. 1.62). 

• Problem 1.37 Express the unit vectors f, 8. tp in terms of x, y, z (that is, derive Eq. 1.64). 

Check your answers several ways (r ■ r = 1 , 8 , ■ tp = 0, r x 6 = tp, . . .). Also work out the 
inverse formulas, giving x, y, z in terms of r, 8, tp (and 0, tp). 

• Problem 1.38 

(a) Check the divergence theorem for the function V] = r 2 r, using as your volume the sphere 
of radius R, centered at the origin. 

b) Do the same for = (l/r 2 )r. (If the answer surprises you, look back at Prob. 1.16.) 
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Figure 1.40 Figure 1.41 



Problem 1.39 Compute the divergence of the function 

v = (r cos 6) r + (r sin 6) 6 + (r sin 6 cos <p) 4>. 

Check the divergence theorem for this function, using as your volume the inverted hemispher- 
ical bowl of radius R, resting on the xy plane and centered at the origin (Fig. 1 .40). 

Problem 1.40 Compute the gradient and Laplacian of the function T = r ( cos 8 + sin 8 cos tp). 
Check the Laplacian by converting T to Cartesian coordinates and using Eq. 1.42. Test the 
gradient theorem for this function, using the path shown in Fig. 1.41, from (0, 0, 0) to (0, 0, 2). 



1.4.2 Cylindrical Coordinates 

The cylindrical coordinates (s,<p,z) of a point P are defined in Fig. 1.42. Notice that f 
has the same meaning as in spherical coordinates, and z is the same as Cartesian; 5 is the 
distance to P from the z axis, whereas the spherical coordinate r is the distance from the 
origin. The relation to Cartesian coordinates is 



x=scos<f>, y = sshuj), z = z. 

The unit vectors (Prob. 1.41) are 

s = cos f x + sin <f y, 

0 = — sin f x + cos (j> y, 

z = z. 

The infinitesimal displacements are 



(1.74) 



(1.75) 



dl s = ds. 



dl$ = 5 df, dl z = dz, 



(1.76) 
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so 



d\ — ds s + 5 dpp + dzi, 



and the volume element is 



dx = s ds dtp dz. 

The range of 5 is 0 — »■ oo, <p goes from 0 — »■ 2n, and z from — oc to oo. 
The vector derivatives in cylindrical coordinates are: 



97\ 1 37 » 9T , 

VT = — s 4 p H z. 

ds s dtp dz 



19 1 9tu dv z 

V-v= (iu s ) + ^ H 

s ds s dp dz 



Gradient- 

Divergence: 

Curl: 

V X V : 

Laplacian: 

V 2 T — 

s ds \ ds ) s 2 dp 2 ' 9z 2 
These formulas are also listed inside the front cover. 



19u,_9^\ /9u, _9u, 

s dp dz J \dz 9s ' 



ds 



(svt) 



dv s 

dp y 



I z. 



9 19 / ar\ l d z T d l T 

V 2 r = -- 5— +-T— r + 



(1.77) 

(1.78) 

(1.79) 

(1.80) 

(1.81) 

(1.82) 



Problem 1.41 Express the cylindrical unit vectors s, p. z in terms of x, y, z (that is, derive 
Eq. 1.75). “Invert” your formulas to get x, y, z in terms of s, p, z (and p). 
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Figure 1.43 



Figure 1.44 



Problem 1.42 

(a) Find the divergence of the function 

v = s (2 + sin 2 </>) s + s sin <p cos <p 4 > + 3z z. 

(b) Test the divergence theorem for this function, using the quarter-cylinder (radius 2, height 
5) shown in Fig. 1.43. 

(c) Find the curl of v. 



1.5 The Dirac Delta Function 



1.5.1 The Divergence of r/r 2 



Consider the vector function 




(1.83) 



At every location, v is directed radially outward (Fig. 1 .44); if ever there was a function that 
ought to have a large positive divergence, this is it. And yet, when you actually calculate 
the divergence (using Eq. 1.71), you get precisely zero : 



1 9 
r 2 dr 




1 9 

- 2 ^- 0 ) =°. 
r z dr 



(1.84) 



(You will have encountered this paradox already, if you worked Prob. 1.16.) The plot 
thickens if you apply the divergence theorem to this function. Suppose we integrate over a 
sphere of radius R, centered at the origin (Prob. 1 ,38b); the surface integral is 




(1.85) 
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But the volume integral, / V • ydr, is zero, if we are really to believe Eq. 1.84. Does this 
mean that the divergence theorem is false? What’s going on here? 

The source of the problem is the point r = 0, where v blows up (and where, in Eq. 1 .84, 
we have unwittingly divided by zero). It is quite true that V • v = 0 everywhere except 
the origin, but right at the origin the situation is more complicated. Notice that the surface 
integral (1.85) is independent of R\ if the divergence theorem is right (and it is), we should 
get /(V • v) dr = 4 jt for any sphere centered at the origin, no matter how small. Evidently 
the entire contribution must be coming from the point r — 0! Thus, V • v has the bizarre 
property that it vanishes everywhere except at one point, and yet its integral (over any 
volume containing that point) is 4n . No ordinary function behaves like that. (On the other 
hand, a physical example does come to mind: the density (mass per unit volume) of a point 
particle. It’s zero except at the exact location of the particle, and yet its integral is finite — 
namely, the mass of the particle.) What we have stumbled on is a mathematical object 
known to physicists as the Dirac delta function. It arises in many branches of theoretical 
physics. Moreover, the specific problem at hand (the divergence of the function r / r 2 ) is not 
just some arcane curiosity — it is, in fact, central to the whole theory of electrodynamics. 
So it is worthwhile to pause here and study the Dirac delta function with some care. 



1.5.2 The One-Dimensional Dirac Delta Function 

The one dimensional Dirac delta function, Six), can be pictured as an infinitely high, 
infinitesimally narrow “spike,” with area 1 (Fig. 1.45). That is to say: 



<5W = 



0 , 

oo, 



if x 0 
if x = 0 



(1.86) 



and 




<5(jt) dx — 



1 . 



(1.87) 



Technically, <5(jt) is not a function at all, since its value is not finite at x =0. In the 
mathematical literature it is known as a generalized function, or distribution. It is, if you 



5(x) 



-Area 1 



Figure 1.45 
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Figure 1.46 



like, the limit of a sequence of functions, such as rectangles R„(x), of height n and width 
1/m, or isosceles triangles T n (x), of height n and base 2/m (Fig. 1 .46). 

If / (x) is some “ordinary” function (that is, not another delta function— in fact, just 
to be on the safe side let’s say that /(x) is continuous), then the product f{x)S(x) is zero 
everywhere except at x = 0. It follows that 



f(x)8(x) = f(0)S(x). (1.88) 

(This is the most important fact about the delta function, so make sure you understand why 
it is true: since the product is zero anyway except at x = 0, we may as well replace /(x) 
by the value it assumes at the origin.) In particular 



/ OO /.OO 

f(x)8(x)dx = f( 0) / 8(x)dx = f{ 0). (1.89) 

-OO J -o O 

Under an integral, then, the delta function “picks out” the value of f(x) at x =0. (Flere 
and below, the integral need not run from -oo to +oo; it is sufficient that the domain extend 
across the delta function, and -e to +e would do as well.) 

Of course, we can shift the spike from x = 0 to some other point, x = a (Fig. 1 .47): 

s , . f 0, if x ^ a ] 

i<X “" ) = | oo, if,= fl ] W1,h < L90 > 

Equation 1.88 becomes 

f(x)S(x - a) — f(a)8(x - a), (1.91) 

andEq. 1.89 generalizes to 




f(x)8(x - a)dx = f(a). 



(1.92) 



Example 1.14 



L 



3 



x 3 6(x — 2) dx . 



Evaluate the integral 
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b(x - a) 



- Area 1 



Figure 1.47 



Solution: The delta function picks out the value of x 3 at the point x = 2, so the integral is 
2 3 = 8. Notice, however, that if the upper limit had been 1 (instead of 3) the answer would be 
0, because the spike would then be outside the domain of integration. 



Although S itself is not a legitimate function, integrals over <5 are perfectly acceptable. 
In fact, it’s best to think of the delta function as something that is always intended for use 
under an integral sign. In particular, two expressions involving delta functions (say, D\ (x) 
and D 2 (x)) are considered equal if 6 

/ OO POO 

f(x)Di(x)dx = / f (x)D 2 (x) dx, (1.93) 

-OO J — OO 

for all (“ordinary”) functions f(x). 

Example 1.15 

Show that 

&{kx ) = (1-94) 

where k is any (nonzero) constant. (In particular, 8( — x) = <5(x).) 



Solution: For an arbitrary test function / (x), consider the integral 

/ OO 

f(x)8(kx)dx. 

-OO 

Changing variables, we let y = kx, so that x = y/k, and dx = l/kdy. If k is positive, the 
integration still runs from — oo to +oo, but if k is negative, then x = oo implies y = — oo, and 

6 This is not as arbitrary as it may sound. The crucial point is that the integrals must be equal for any f(x). 
Suppose D\ ( x ) and D 2 (x) actually differed, say, in the neighborhood of the point x = 17. Then we could pick a 
function f (x) that was sharply peaked about x = 17, and the integrals would not be equal. 
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vice versa, so the order of the limits is reversed. Restoring the “proper” order costs a minus 
sign. Thus 

f°° dy 11 

/ f(x)S(kx)dx = ± / f(y/k)S(y)-^ = ±-/(0) = — /( 0). 

(The lower signs apply when k is negative, and we account for this neatly by putting absolute 
value bars around the final k, as indicated.) Under the integral sign, then, S(kx) serves the 
same purpose as (l/|£|)<5(x): 

/ oo /-oo r j 

f (x)S(kx) dx = I f(x) —S(x) dx. 

-oo J-O 0 l\k\ 

According to criterion 1.93, therefore, S(kx) and (1 /|jfc|)5(jc) are equal. 



Problem 1.43 Evaluate the following integrals: 

(a) ( 3x 2 —2x — 1) <5(x — 3) dx. 
r5 

(b) J 0 cos xS(x —n)dx. 

(c) / 0 3 x 3 <5(x + 1) dx. 

(d) f x O0 ln(x + 3)8(x + 2)dx. 

Problem 1.44 Evaluate the following integrals: 

(a) f- 2 @ x + 3) 5(3jc) dx. 

(b) / Q 2 (x 3 +3x + 2)5(1 -x)dx. 

(c) j 9x 2 S(3x + 1 )dx. 

(d )f“ 00 8(x-b)dx. 

Problem 1.45 

(a) Show that 

x^-(S(x)) = -8(x). 
dx 

[Hint: Use integration by parts.] 

(b) Let 9(x) be the step function: 

1 , if jc > 0 1 

0{x)= |. (1.95) 

0, if x < 0 J 



Show \ha.td6/dx = 5(x). 
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1.5.3 The Three-Dimensional Delta Function 

It is an easy matter to generalize the delta function to three dimensions: 

8 3 (r)=8(x)8(y)8(z). (1.96) 



(As always, r = x x + y y + z z is the position vector, extending from the origin to the point 
(x, y, z)). This three-dimensional delta function is zero everywhere except at (0, 0, 0), 
where it blows up. Its volume integral is 1 : 



L 



/ oo poo poo 

/ / 8(x) 8(y) 8(z)dx dy dz = l. 

■OO J — CO J — OO 



(1.97) 



And, generalizing Eq. 1.92, 



L 



f (r)<$ J (r - a) dr = /(a). 



(1.98) 



all space 



As in the one-dimensional case, integration with 8 picks out the value of the function / at 
the location of the spike. 

We are now in a position to resolve the paradox introduced in Sect. 1.5.1. As you will 
recall, we found that the divergence of r /r 2 is zero everywhere except at the origin, and 
yet its integral over any volume containing the origin is a constant (to wit: Ait). These are 
precisely the defining conditions for the Dirac delta function; evidently 



V ' (^) = 4 ^ 3 ( r )- 



More generally. 



V • 



4jr<5 3 (*), 



(1.99) 



(1.100) 



where, as always, >i is the separation vector: 4 = r — r'. Note that differentiation here is 
with respect to r, while r' is held constant. Incidentally, since 



V 



4 



-t 

72 



( 1 . 101 ) 



(Prob. 1.13), it follows that 



V 2 - = 



ft 



47r<5 3 (*). 



(1.102) 
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Example 1.16 

Evaluate the integral 

•' = /v ( ' 2 + 2)V ■(?)■"• 

where V is a sphere of radius R centered at the origin. 



Solution 1: Use Eq. 1.99 to rewrite the divergence, and Eq. 1.98 to do the integral: 

J= (r 2 + 2)47r<5 3 (r) dr = 4n(0 + 2) = 8;r. 

JV 

This one-line solution demonstrates something of the power and beauty of the delta function, 
but I would like to show you a second method, which is much more cumbersome but serves to 
illustrate the method of integration by parts, Sect. 1 .3.6. 



Solution 2: Using Eq. 1.59, we transfer the derivative from r/r 2 to (r 2 + 2): 



J = ~ J ^ ■ [V(r 2 + 2)]dr + <^>(r 2 +2)^ • da. 



V(r z + 2) = 2rr, 



The gradient is 
so the volume integral becomes 

J - dr = J -r 2 sinO dr dO dip = Sjt J rdr=4nR 2 . 

Meanwhile, on the boundary of the sphere (where r = R), 



da = R 2 sin 0 dO dip r , 



so the surface integral becomes 

J (R 2 + 2) sin 9 dO dtp = An{R 2 + 2). 
Putting it all together, then, 



J = -4nR 2 + An{R 2 + 2) = 8?r, 



as before. 
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Problem 1.46 

(a) Write an expression for the electric charge density p(r) of a point charge q at r'. Make 
sure that the volume integral of p equals q. 

(b) What is the charge density of an electric dipole, consisting of a point charge — q at the 
origin and a point charge +q at a? 

(c) What is the charge density of a uniform, infinitesimally thin spherical shell of radius R and 
total charge Q, centered at the origin? [Beware: the integral over all space must equal Q.] 

Problem 1.47 Evaluate the following integrals: 

(a) / all space (r 2 + r • a + u 2 )5 3 (r — a) dr, where a is a fixed vector and a is its magnitude. 

(b) /y |r — b| 2 <5 3 (5r) dr, where V is a cube of side 2, centered on the origin, andb = 4y + 3z. 

(c) /y(r 4 + r 2 (r ■ c) + c 4 )<5 3 (r — c) dr, where V is a sphere of radius 6 about the origin, 
c = 5x + 3y + 2z, and c is its magnitude. 

(d) /y r ■ (d — r)5 3 (e — r) dr, where d = (1. 2, 3), e = (3, 2, 1), and V is a sphere of radius 
1.5 centered at (2, 2, 2). 

Problem 1.48 Evaluate the integral 




(where V is a sphere of radius R, centered at the origin) by two different methods, as in Ex. 1.16. 



1.6 The Theory of Vector Fields 

1.6.1 The Helmholtz Theorem 

Ever since Faraday, the laws of electricity and magnetism have been expressed in terms of 
electric and magnetic fields, E and B. Like many physical laws, these are most compactly 
expressed as differential equations. Since E and B are vectors, the differential equations 
naturally involve vector derivatives: divergence and curl. Indeed, Maxwell reduced the 
entire theory to four equations, specifying respectively the divergence and the curl of E and 
B . 7 

7 Strictly speaking, this is only true in the static case; in general, the divergence and curl are given in terms of 
time derivatives of the fields themselves. 
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Maxwell’s formulation raises an important mathematical question: To what extent is a 
vector function determined by its divergence and curl? In other words, if I tell you that the 
divergence of F (which stands for E or B, as the case may be) is a specified (scalar) function 
D, 

V F = D, 

and the curl of F is a specified (vector) function C, 

V x F = C, 

(for consistency, C must be divergenceless, 

V • C = 0, 

because the divergence of a curl is always zero), can you then determine the function F? 

Well. . . not quite. For example, as you may have discovered in Prob. 1.19, there are 
many functions whose divergence and curl are both zero everywhere — the trivial case F = 0, 
of course, but also F = yzx + zx y +xy z, F = sin x cosh y x — cos x sinhy y, etc. To solve 
a differential equation you must also be supplied with appropriate boundary conditions. 
In electrodynamics we typically require that the fields go to zero “at infinity’’ (far away 
from all charges). 8 With that extra information the Helmholtz theorem guarantees that the 
field is uniquely determined by its divergence and curl. (A proof of the Helmholtz theorem 
is given in Appendix B.) 

1.6.2 Potentials 

If the curl of a vector field (F) vanishes (everywhere), then F can be written as the gradient 

of a scalar potential (V): 

V x F = 0 F = -VV. (1.103) 

(The minus sign is purely conventional.) That’s the essential burden of the following 
theorem: 

Theorem 1: Curl-less (or “irrotational”) fields. The following con- 

ditions are equivalent (that is, F satisfies one if and only 
if it satisfies all the others): 

(a) V xF = 0 everywhere. 

(b) / a b F • d\ is independent of path, for any given end 
points. 

(c) (f F • dl — 0 for any closed loop. 

(d) F is the gradient of some scalar, F = — V V. 



8 In some textbook problems the charge itself extends to infinity (we speak, for instance, of the electric field of 
an infinite plane, or the magnetic field of an infinite wire). In such cases the normal boundary conditions do not 
apply, and one must invoke symmetry arguments to determine the fields uniquely. 
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The scalar potential is not unique — any constant can be added to V with impunity, since 
this will not affect its gradient. 

If the divergence of a vector field (F) vanishes (everywhere), then F can be expressed 
as the curl of a vector potential (A): 

V • F = 0 4=^ F = V x A. (1.104) 

That’s the main conclusion of the following theorem: 

Theorem 2: Divergence-less (or “solenoidal”) fields. The following 

conditions are equivalent: 

(a) V • F = 0 everywhere. 

(b) / F-da is independent of surface, for any given bound- 
ary line. 

(c) <f F • da = 0 for any closed surface. 

(d) F is the curl of some vector, F = V x A. 

The vector potential is not unique — the gradient of any scalar function can be added to A 
without affecting the curl, since the curl of a gradient is zero. 

You should by now be able to prove all the connections in these theorems, save for 
the ones that say (a), (b), or (c) implies (d). Those are more subtle, and will come later. 
Incidentally, in all cases ( whatever its curl and divergence may be) a vector field F can be 
written as the gradient of a scalar plus the curl of a vector: 

F = — VV + VxA (always). (1.105) 



Problem 1.49 

(a) Let Fj = x 2 i and F 2 = ix + jy + zz. Calculate the divergence and curl of F 1 and F 2 . 
Which one can be written as the gradient of a scalar? Find a scalar potential that does the job. 
Which one can be written as the curl of a vector? Find a suitable vector potential. 

(b) Show that F 3 = yz x + zx y + xy z can be written both as the gradient of a scalar and as 
the curl of a vector. Find scalar and vector potentials for this function. 

Problem 1.50 For Theorem 1 show that (d) => (a), (a) => (c), (c) => (b), (b) => (c), and 

(c) => (a). 



Problem 1.51 For Theorem 2 show that (d) => (a), (a) => (c), (c) => (b), (b) => (c), and 
(c) => (a). 
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Problem 1.52 

(a) Which of the vectors in Problem 1.15 can be expressed as the gradient of a scalar? Find a 
scalar function that does the job. 

(b) Which can be expressed as the curl of a vector? Find such a vector. 



More Problems on Chapter 1 

Problem 1.53 Check the divergence theorem for the function 

v = r 2 cos 9 f + r 2 cos (f>6 — r 2 cos 9 sin cf> <f>, 

using as your volume one octant of the sphere of radius R (Fig. 1 .48). Make sure you include 
the entire surface. [Answer: 

Problem 1.54 Check Stokes’ theorem using the function v = ay x+bx y ( a and b are constants) 
and the circular path of radius R, centered at the origin in the xy plane. [Answer: irR 2 (b- a)\ 

Problem 1.55 Compute the line integral of 

v = 6 x + yz 2 y + (3y + z)z 

along the triangular path shown in Fig. 1.49. Check your answer using Stokes’ theorem. 
[Answer: 8/3] 



Problem 1.56 Compute the line integral of 

v = (r cos 2 9) r — (r cos 9 sin 9) 0 + 3r 4> 

around the path shown in Fig. 1.50 (the points are labeled by their Cartesian coordinates). Do 
it either in cylindrical or in spherical coordinates. Check your answer, using Stokes’ theorem. 
[Answer: 3n/2] 




Figure 1.48 



Figure 1.49 



Figure 1.50 
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Figure 1.51 



Figure 1.52 



Problem 1.57 Check Stokes’ theorem for the function v = y z, using the triangular surface 
shown in Fig. 1.51. [Answer: a 2 ] 

Problem 1.58 Check the divergence theorem for the function 

v = r 2 sin 9 r + 4r 2 cos 9 6 + r 2 tan 9 <f>, 

using the volume of the “ice-cream cone” shown in Fig. 1 .52 (the top surface is spherical, with 
radius R and centered at the origin). [Answer: (nR 4 / 12) (2^ + 3^)] 

Problem 1.59 Here are two cute checks of the fundamental theorems: 

(a) Combine Corollary 2 to the gradient theorem with Stokes’ theorem (v = V7\ in this case). 
Show that the result is consistent with what you already knew about second derivatives. 

(b) Combine Corollary 2 to Stokes’ theorem with the divergence theorem. Show that the result 
is consistent with what you already knew. 

• Problem 1.60 Although the gradient, divergence, and curl theorems are the fundamental in- 
tegral theorems of vector calculus, it is possible to derive a number of corollaries from them. 
Show that: 

(a) /y(Vr)dr = T da. [Hint: Let v = cT, where c is a constant, in the divergence 
theorem; use the product rules.] 

(b) /y(V x v) dr = — § s v x da. [Hint: Replace v by (v x c) in the divergence theorem.] 

(c) f v [TV 2 U + (VT) ■ (VU)[dr = $ S (TVU)- da. [Hint: Let v = TV U in the divergence 
theorem.] 

(d) fy(TV 2 U — UV 2 T) dr = §g{TVU — UVT) • da. [Comment: This is known as Green’s 
theorem; it follows from (c), which is sometimes called Green’s identity.] 

(e) fgVT x da = — fp T d\. [Hint: Let v = cT in Stokes’ theorem.] 
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Problem 1.61 The integral 




(1.106) 



is sometimes called the vector area of the surface S. If S happens to be flat, then |a| is the 
ordinary (scalar) area, obviously. 

(a) Find the vector area of a hemispherical bowl of radius R. 

(b) Show that a = 0 for any closed surface. [Hint: Use Prob. 1.60a.] 

(c) Show that a is the same for all surfaces sharing the same boundary. 



(d) Show that 



a 



1 

2 



r x d\, 



(1.107) 



where the integral is around the boundary line. [Hint: One way to do it is to draw the cone 
subtended by the loop at the origin. Divide the conical surface up into infinitesimal triangu- 
lar wedges, each with vertex at the origin and opposite side dl, and exploit the geometrical 
interpretation of the cross product (Fig. 1.8).] 

(e) Show that 

^(c ■ r)dl = a x c, (1.108) 

for any constant vector c. [Hint: let 7 = c • r in Prob. 1 ,60e.] 



Problem 1.62 

(a) Find the divergence of the function 

r 

v = -. 
r 

First compute it directly, as in Eq. 1 .84. Test your result using the divergence theorem, as in 
Eq. 1.85. Is there a delta function at the origin, as there was for r/r 2 ? What is the general 
formula for the divergence of r n rl [Answer: V • ( r n r ) = (n + 2 )r" _1 , unless n = -2, in 
which case it is 4^5 3 (r)] 

(b) Find the curl of r n r. Test your conclusion using Prob. 1.60b. [Answer: V x (r n r) = 0] 



Chapter 2 

Electrostatics 



2.1 The Electric Field 

2.1.1 Introduction 

The fundamental problem electromagnetic theory hopes to solve is this (Fig. 2.1): We have 
some electric charges, q\ , <72, <73, • • • (call them source charges); what force do they exert 
on another charge, Q (call it the test charge)? The positions of the source charges are given 
(as functions of time); the trajectory of the test particle is to be calculated. In general, both 
the source charges and the test charge are in motion. 

The solution to this problem is facilitated by the principle of superposition, which states 
that the interaction between any two charges is completely unaffected by the presence of 
others. This means that to determine the force on 2, we can first compute the force F 1 , due 
to q 1 alone (ignoring all the others); then we compute the force F 2 , due to <72 alone; and so 
on. Finally, we take the vector sum of all these individual forces: F = Fi + F 2 + F 3 + . . . 
Thus, if we can find the force on Q due to a single source charge q, we are, in principle, 
done (the rest is just a question of repeating the same operation over and over, and adding 
it all up ). 1 

Well, at first sight this sounds very easy: Why don’t I just write down the formula for 
the force on Q due to q, and be done with it? I could, and in Chapter 10 I shall, but you 
would be shocked to see it at this stage, for not only does the force on Q depend on the 
separation distance ^between the charges (Fig. 2.2), it also depends on both their velocities 
and on the acceleration of q. Moreover, it is not the position, velocity, and acceleration 
of q right now that matter: Electromagnetic “news” travels at the speed of light, so what 
concerns Q is the position, velocity, and acceleration q had at some earlier time, when the 
message left. 



' The principle of superposition may seem “obvious” to you, but it did not have to be so simple: if the electromag- 
netic force were proportional to the square of the total source charge, for instance, the principle of superposition 
would not hold, since (qy + q2) 2 i=- q\ + (there would be “cross terms” to consider). Superposition is not a 
logical necessity, but an experimental fact. 
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q 2 » 



Q 




Figure 2. 1 



Figure 2.2 



Therefore, in spite of the fact that the basic question (“What is the force on Q due to 
qT’) is easy to state, it does not pay to confront it head on; rather, we shall go at it by 
stages. In the meantime, the theory we develop will permit the solution of more subtle 
electromagnetic problems that do not present themselves in quite this simple format. To 
begin with, we shall consider the special case of electrostatics in which all the source 
charges are stationary (though the test charge may be moving). 

2.1.2 Coulomb’s Law 

What is the force on a test charge Q due to a single point charge q which is at rest a distance 
n, away? The answer (based on experiments) is given by Coulomb’s law: 



F- 1 qQ k 

* — ~a T*- 

AlC (zQ 



(2.1) 



The constant eo is called the permitivity of free space. In SI units, where force is in 
Newtons (N), distance in meters (m), and charge in coulombs (C), 

C 2 

e 0 = 8.85 x 1<T 12 - .. 

N • m 2 

In words, the force is proportional to the product of the charges and inversely proportional 
to the square of the separation distance. As always (Sect. 1.1.4), >i is the separation vector 
from r' (the location of q) to r (the location of Q): 

* = r - r'; (2.2) 

n, is its magnitude, and i is its direction. The force points along the line from q to Q\ it is 
repulsive if q and Q have the same sign, and attractive if their signs are opposite. 

Coulomb’s law and the principle of superposition constitute the physical input for 
electrostatics — the rest, except for some special properties of matter, is mathematical elab- 
oration of these fundamental rules. 
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Problem 2.1 

(a) Twelve equal charges, q, are situated at the comers of a regular 12-sided polygon (for 
instance, one on each numeral of a clock face). What is the net force on a test charge Q at the 
center? 

(b) Suppose one of the 12 q ’ s is removed (the one at “6 o’clock”). What is the force on Q1 
Explain your reasoning carefully. 

(c) Now 13 equal charges, q, are placed at the comers of a regular 13-sided polygon. What is 
the force on a test charge Q at the center? 

(d) If one of the 13 q ’ s is removed, what is the force on Q! Explain your reasoning. 



2.1.3 The Electric Field 



If we have several point charges q \ , < 72 , • • • , < 7 «, at distances^ , 12 , . . . ,i n from 2> the total 
force on Q is evidently 



Fl + F2 + . . . 



47re 0 



<71 2;: , <722 



2 / < 71*1 , < 72*2 , < 73*3 



4ne 0 



+ 



+ 



-*1 + 



+ ... 



*2 + 



or 



where 



F= 2E, 



(2.3) 



E(r) = 



1 n 
_L y 
4^o fy *f 



(2.4) 



E is called the electric field of the source charges. Notice that it is a function of position (r), 
because the separation vectors 4, depend on the location of the field point P (Fig. 2.3). But 
it makes no reference to the test charge Q. The electric field is a vector quantity that varies 




Figure 2.3 
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from point to point and is determined by the configuration of source charges; physically, 
E(r) is the force per unit charge that would be exerted on a test charge, if you were to place 
one at P. 

What exactly is an electric field? I have deliberately begun with what you might call 
the “minimal” interpretation of E, as an intermediate step in the calculation of electric 
forces. But I encourage you to think of the field as a “real” physical entity, filling the 
space in the neighborhood of any electric charge. Maxwell himself came to believe that 
electric and magnetic fields represented actual stresses and strains in an invisible primordial 
jellylike “ether.” Special relativity has forced us to abandon the notion of ether, and with it 
Maxwell’s mechanical interpretation of electromagnetic fields. (It is even possible, though 
cumbersome, to formulate classical electrodynamics as an “action-at-a-distance” theory, 
and dispense with the field concept altogether.) I can’t tell you, then, what a field is — only 
how to calculate it and what it can do for you once you’ve got it. 

Problem 2.2 

(a) Find the electric field (magnitude and direction) a distance z above the midpoint between 
two equal charges, q , a distance d apart (Fig. 2.4). Check that your result is consistent with 
what you’d expect when z » d. 

(b) Repeat part (a), only this time make the right-hand charge -q instead of +q. 




Figure 2.4 Figure 2.5 



2.1.4 Continuous Charge Distributions 

Our definition of the electric field (Eq. 2.4), assumes that the source of the field is a collection 
of discrete point charges r/, . If, instead, the charge is distributed continuously over some 
region, the sum becomes an integral (Fig. 2.5a): 

1 f 1 „ 

E(r) = / -^*dq. 

47reo J n. 1 



(2.5) 
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If the charge is spread out along a line (Fig. 2.5b), with charge-per-unit-length A, then 
dq = A dl' (where dl' is an element of length along the line); if the charge is smeared 
out over a surface (Fig. 2.5c), with charge-per-unit-area a, then dq = a da' (where da' 
is an element of area on the surface); and if the charge fills a volume (Fig. 2.5d), with 
charge-per-unit-volume p, then dq = pdr' (where dr 1 is an element of volume): 

dq — > A dl' ~ a da' ~ p dr'. 

Thus the electric field of a line charge is 



for a surface charge, 



and for a volume charge, 



E(r) = 



E(r) = 



1 / 


' MO 


47 re 0 J 

V 




1 [ 


a(r') 


47 re 0 J 

S 




1 f 


p(x') 


4 ?r e 0 J 

V 


* 2 ' 



idl' 



ida': 



( 2 . 6 ) 



(2.7) 



( 2 . 8 ) 



Equation 2.8 itself is often referred to as “Coulomb’s law,” because it is such a short 
step from the original (2.1), and because a volume charge is in a sense the most general 
and realistic case. Please note carefully the meaning of >i in these formulas. Originally, in 
Eq.2.4,4; stood for the vector from the source charge t/,- to the field point r. Correspondingly, 
in Eqs. 2. 5-2. 8, 4 is the vector from dq (therefore from dl' , da ' , or dr') to the field point 



Example 2.1 

Find the electric field a distance z above the midpoint of a straight line segment of length 2 L, 
which carries a uniform line charge A (Fig. 2.6). 

Solution: It is advantageous to chop the line up into symmetrically placed pairs (at ±jc), for 
then the horizontal components of the two fields cancel, and the net field of the pair is 



dE = 2 

4ff6 0 



A dx 



cos 9 z. 



2 Warning: The unit vector & is not constant; its direction depends on the source point r', and hence it cannot be 
taken outside the integrals 2.5-2. 8. In practice, you must work with Cartesian components (x, y, z are constant, 
and do come out), even if you use curvilinear coordinates to perform the integration. 
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Here cos 9 = z/'i, 'i = \/z 2 + -* 2 , and x runs from 0 to L: 



1 


f L 2 kz 




47T6Q , 


(z 2 + * 2 ) 3 / 2 




r 


L 


2 kz 


X 




4^«o 


_z 2 Vz 2 +x 2 _ 


0 


1 


2k L 





4?r6 0 z\/ z 2 + L 2 



and it aims in the --direction. 

For points far from the line (z >> L), this result simplifies: 



1 2 XL 
z 2 



which makes sense: From far away the line “looks” like a point charge q = 2 XL, so the field 
reduces to that of point charge q /{Aneoz^) ■ In the limit L — >• oo, on the other hand, we obtain 
the field of an infinite straight wire: 



E = 



1 2X 

4;reo z 



or, more generally, 



E = 



1 2k 

AjtdQ S 



where ,v is the distance from the wire. 



(2.9) 



Problem 2.3 Find the electric field a distance z above one end of a straight line segment of 
length L (Fig. 2.7), which carries a uniform line charge k. Check that your formula is consistent 
with what you would expect for the case z > L. 
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Figure 2.7 



Figure 2.8 Figure 2.9 



Problem 2.4 Find the electric field a distance z above the center of a square loop (side a) 
carrying uniform line charge X (Fig. 2.8). [Hint: Use the result of Ex. 2.1.] 

Problem 2.5 Find the electric field a distance z above the center of a circular loop of radius r 
(Fig. 2.9), which carries a uniform line charge X. 

Problem 2.6 Find the electric field a distance z above the center of a flat circular disk of radius 
R (Fig. 2.10), which carries a uniform surface charge a . What does your formula give in the 
limit R — > oo? Also check the case z 5§> R. 

Problem 2.7 Find the electric field a distance z from the center of a spherical surface of radius 
R (Fig. 2. 1 1), which carries a uniform charge density a. Treat the case z < R (inside) as well 
as z > R (outside). Express your answers in terms of the total charge q on the sphere. [Hint: 
Use the law of cosines to writer in terms of R and 9. Be sure to take the positive square root: 
s/ R 2 + z 2 — 2 Rz = (R — z) if R > z, but it’s (z — R) if R < z.] 

Problem 2.8 Use your result in Prob. 2.7 to find the field inside and outside a sphere of radius 
R, which carries a uniform volume charge density p. Express your answers in terms of the 
total charge of the sphere, q. Draw a graph of |E| as a function of the distance from the center. 
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2.2 Divergence and Curl of Electrostatic Fields 

2.2.1 Field Lines, Flux, and Gauss’s Law 

In principle, we are done with the subject of electrostatics. Equation 2.8 tells us how to 
compute the field of a charge distribution, and Eq. 2.3 tells us what the force on a charge Q 
placed in this field will be. Unfortunately, as you may have discovered in working Prob. 2.7, 
the integrals involved in computing E can be formidable, even for reasonably simple charge 
distributions. Much of the rest of electrostatics is devoted to assembling a bag of tools and 
tricks for avoiding these integrals. It all begins with the divergence and curl of E. I shall 
calculate the divergence of E directly from Eq. 2.8, in Sect. 2.2.2, but first I want to show 
you a more qualitative, and perhaps more illuminating, intuitive approach. 

Let’s begin with the simplest possible case: a single point charge q, situated at the 
origin: 

E(r) = — — 4 ?. ( 2 . 10 ) 

47T60 r 

To get a “feel” for this field, I might sketch a few representative vectors, as in Fig. 2.12a. 
Because the field falls off like 1/r 2 , the vectors get shorter as you go farther away from the 
origin; they always point radially outward. But there is a nicer way to represent this field, 
and that’s to connect up the arrows, to form field lines (Fig. 2.12b). You might think that I 
have thereby thrown away information about the strength of the field, which was contained 
in the length of the arrows. But actually I have not. The magnitude of the field is indicated 
by the density of the field lines: it’s strong near the center where the field lines are close 
together, and weak farther out, where they are relatively far apart. 

In truth, the field-line diagram is deceptive, when I draw it on a two-dimensional surface, 
for the density of lines passing through a circle of radius r is the total number divided by the 
circumference ( n/2nr ), which goes like (1/r), not (1/r 2 ). But if you imagine the model in 
three dimensions (a pincushion with needles sticking out in all directions), then the density 
of lines is the total number divided by the area of the sphere (n /4jtr 2 ), which does go like 
(1/r 2 ). 




Figure 2. 1 2 
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Equal but opposite charges 
Figure 2.13 



Such diagrams are also convenient for representing more complicated fields. Of course, 
the number of lines you draw depends on how energetic you are (and how sharp your pencil 
is), though you ought to include enough to get an accurate sense of the field, and you must 
be consistent: If charge q gets 8 lines, then 2 q deserves 16. And you must space them 
fairly — they emanate from a point charge symmetrically in all directions. Field lines begin 
on positive charges and end on negative ones; they cannot simply terminate in midair, though 
they may extend out to infinity. Moreover, field lines can never cross — at the intersection, 
the field would have two different directions at once! With all this in mind, it is easy to 
sketch the field of any simple configuration of point charges: Begin by drawing the lines 
in the neighborhood of each charge, and then connect them up or extend them to infinity 
(Figs. 2.13 and 2.14). 




Figure 2.14 
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Figure 2.15 



In this model the flux of E through a surface S, 

-t> E = j E-da, (2.11) 

is a measure of the “number of field lines’’ passing through <S. I put this in quotes because of 
course we can only draw a representative sample of the field lines — the total number would 
be infinite. But for a given sampling rate the flux is proportional to the number of lines 
drawn, because the field strength, remember, is proportional to the density of field lines 
(the number per unit area), and hence E • da is proportional to the number of lines passing 
through the infinitesimal area da. (The dot product picks out the component of da along 
the direction of E, as indicated in Fig. 2.15. It is only the area in the plane perpendicular 
to E that we have in mind when we say that the density of field lines is the number per unit 
area.) 

This suggests that the flux through any closed surface is a measure of the total charge 
inside. For the field lines that originate on a positive charge must either pass out through 
the surface or else terminate on a negative charge inside (Fig. 2. 16a). On the other hand, a 
charge outside the surface will contribute nothing to the total flux, since its field lines pass 
in one side and out the other (Fig. 2.16b). This is the essence of Gauss’s law. Now let’s 
make it quantitative. 




(a) 



(b) 



Figure 2.16 
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In the case of a point charge q at the origin, the flux of E through a sphere of radius r is 



E • da 



I ( • (r 2 sin 9 dO d(p r) = — q 

J 47T60 Vr 2 / e 0 



(2.12) 



Notice that the radius of the sphere cancels out, for while the surface area goes up as r 2 , the 
field goes down as 1/r 2 , and so the product is constant. In terms of the field-line picture, this 
makes good sense, since the same number of field lines passes through any sphere centered 
at the origin, regardless of its size. In fact, it didn’t have to be a sphere — any closed surface, 
whatever its shape, would trap the same number of field lines. Evidently the flux through 
any surface enclosing the charge is q /eo- 

Now suppose that instead of a single charge at the origin, we have a bunch of charges 
scattered about. According to the principle of superposition, the total field is the (vector) 
sum of all the individual fields: 

n 

E = I>- 

i=i 

The flux through a surface that encloses them all, then, is 



/ E ^ a = E(/ E '^ a ) = E 




For any closed surface, then, 




(2.13) 



where Q enc is the total charge enclosed within the surface. This is the quantitative state- 
ment of Gauss’s law. Although it contains no information that was not already present in 
Coulomb’s law and the principle of superposition, it is of almost magical power, as you will 
see in Sect. 2.2.3. Notice that it all hinges on the 1 /r 2 character of Coulomb’s law; without 
that the crucial cancellation of the r’s in Eq. 2.12 would not take place, and the total flux 
of E would depend on the surface chosen, not merely on the total charge enclosed. Other 
1/r 2 forces (I am thinking particularly of Newton’s law of universal gravitation) will obey 
“Gauss’s laws” of their own, and the applications we develop here carry over directly. 

As it stands, Gauss’s law is an integral equation, but we can readily turn it into a 
differential one, by applying the divergence theorem: 



E • da 



s 



j (V • E) dr. 
v 



Rewriting gene in terms of the charge density p, we have 



Q 



enc 



/ 



pdr. 



V 
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So Gauss’s law becomes 



v v 



dr. 



And since this holds for any volume, the integrands must be equal: 



V E = 



1 

— P- 
eo 



(2.14) 



Equation 2.14 carries the same message as Eq. 2.13; it is Gauss’s law in differential 
form. The differential version is tidier, but the integral form has the advantage in that it 
accommodates point, line, and surface charges more naturally. 



Problem 2.9 Suppose the electric field in some region is found to be E = fcr 3 ?, in spherical 
coordinates (k is some constant). 

(a) Find the charge density p. 

(b) Find the total charge contained in a sphere of radius R, centered at the origin. (Do it two 
different ways.) 

Problem 2.10 A charge q sits at the back comer of a cube, as shown in Fig. 2.17. What is the 
flux of E through the shaded side? 




2.2.2 The Divergence of E 

Let’s go back, now, and calculate the divergence of E directly from Eq. 2.8: 

E<r) = i / <2I5) 

all space 



(Originally the integration was over the volume occupied by the charge, but I may as 
well extend it to all space, since p = 0 in the exterior region anyway.) Noting that the 
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r-dependence is contained in n. = r — r', we have 

1 



V E = 



4jre 0 






p(r') dx'. 



This is precisely the divergence we calculated in Eq. 1.100: 



V K = 4ttS». 



Thus 



V -E = 



1 

4^60 



/ 



47Ti5 3 (r 



r')p(r')dx' 



1 

— P(r), 

eo 



(2.16) 



which is Gauss’s law in differential form (2.14). To recover the integral form (2.13), we 
run the previous argument in reverse — integrate over a volume and apply the divergence 
theorem: 



/ 



V -Edx = 



S 



E • da 



1 

eo 




1 

eo 



Gene- 



2.2.3 Applications of Gauss’s Law 

I must interrupt the theoretical development at this point to show you the extraordinary 
power of Gauss’s law, in integral form. When symmetry permits, it affords by far the 
quickest and easiest way of computing electric fields. I’ll illustrate the method with a series 
of examples. 

Example 2.2 



Find the field outside a uniformly charged solid sphere of radius R and total charge q. 

Solution: Draw a spherical surface at radius r > R (Fig. 2. 1 8); this is called a “Gaussian 
surface’’ in the trade. Gauss’s law says that for this surface (as for any other) 



E ■ da — — Qenc ■ 

eo 



and Qenc = q. At first glance this doesn’t seem to get us very far, because the quantity we 
want (E) is buried inside the surface integral. Luckily, symmetry allows us to extract E from 
under the integral sign: E certainly points radially outward, 3 as does da, so we can drop the 
dot product, 

jE-da = J |E| da, 

S S 

3 If you doubt that E is radial, consider the alternative. Suppose, say, that it points due east, at the “equator.” But 
the orientation of the equator is perfectly arbitrary — nothing is spinning here, so there is no natural “north-south” 
axis — any argument purporting to show that E points east could just as well be used to show it points west, or 
north, or any other direction. The only unique direction on a sphere is radial. 
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Gaussian 

surface 




Figure 2.18 



and the magnitude of E is constant over the Gaussian surface, so it comes outside the integral: 



/ 



|E| da = |E| 



/ 



da = |E| 4 nr 2 . 



Thus 



or 



|E|4 nr 2 



1 

<?0 



E = 



1 q - 

4jreo r 2 



Notice a remarkable feature of this result: The field outside the sphere is exactly the same as 
it would have been if all the charge had been concentrated at the center. 



Gauss’s law is always true, but it is not always useful. If p had not been uniform (or, at 
any rate, not spherically symmetrical), or if I had chosen some other shape for my Gaussian 
surface, it would still have been true that the flux of E is (1/eo )q, but I would not have 
been certain that E was in the same direction as da and constant in magnitude over the 
surface, and without thdt I could not pull |E| out of the integral. Symmetry is crucial to this 
application of Gauss’s law. As far as I know, there are only three kinds of symmetry that 
work: 

1 . Spherical symmetry. Make your Gaussian surface a concentric sphere. 

2. Cylindrical symmetry. Make your Gaussian surface a coaxial cylinder 
(Fig. 2.19). 

3. Plane symmetry. Use a Gaussian “pillbox,” which straddles the surface 
(Fig. 2.20). 

Although (2) and (3) technically require infinitely long cylinders, and planes extending to 
infinity in all directions, we shall often use them to get approximate answers for “long” 
cylinders or “large” plane surfaces, at points far from the edges. 
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Example 2.3 



A long cylinder (Fig. 2.21) carries a charge density that is proportional to the distance from 
the axis: p = ks , for some constant k. Find the electric field inside this cylinder. 



Solution: Draw a Gaussian cylinder of length l and radius 5. For this surface, Gauss’s law 
states: 



E-da = — 2enc- 
^0 



The enclosed charge is 

Qenc = J pdr = J (ks')(s' ds' d(f> dz) = 2nkl J s' 2 ds' = |jt kls^ . 

(I used the volume element appropriate to cylindrical coordinates, Eq. 1 .78, and integrated (f> 
from 0 to 2tc, dz from 0 to /. I put a prime on the integration variable s', to distinguish it from 
the radius 5 of the Gaussian surface.) 



! 




Figure 2.21 
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Now, symmetry dictates that E must point radially outward, so for the curved portion of the 
Gaussian cylinder we have: 

j E • da = J |E| da = |E| j da = |E| 2 nsl, 

while the two ends contribute nothing (here E is perpendicular to da). Thus, 

1 2 ^ 

|E| 2nsl = nkls i . 

e 0 3 



or, finally, 



E = - — ks l s. 

3g() 



Example 2.4 



An infinite plane carries a uniform surface charge a. Find its electric field. 

Solution: Draw a “Gaussian pillbox,” extending equal distances above and below the plane 
(Fig. 2.22). Apply Gauss’s law to this surface: 



S E • da — 2enc • 

/ e 0 

In this case, Q e n c = & A, where A is the area of the lid of the pillbox. By symmetry, E points 
away from the plane (upward for points above, downward for points below). Thus, the top and 
bottom surfaces yield 

J E ■ da = 2A|E|, 




Figure 2.22 
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whereas the sides contribute nothing. Thus 



2A |E| 



1 

— a A, 

eo 



or 




(2.17) 



where n is a unit vector pointing away from the surface. In Prob. 2.6, you obtained this same 
result by a much more laborious method. 



It seems surprising, at first, thaf the field of an infinite plane is independent of how far away 
you are. What about the 1 /r 2 in Coulomb’s law? Well, the point is that as you move farther 
and farther away from the plane, more and more charge comes into your “field of view” (a 
cone shape extending out from your eye), and this compensates for the diminishing influence 
of any particular piece. The electric field of a sphere falls off like 1/r 2 ; the electric field of an 
infinite line falls off like 1/r; and the electric field of an infinite plane does not fall off at all. 



Although the direct u$e of Gauss’s law to compute electric fields is limited to cases of 
spherical, cylindrical, and planar symmetry, we can put together combinations of objects 
possessing such symmetry, even though the arrangement as a whole is not symmetrical. 
For example, invoking the principle of superposition, we could find the field in the vicinity 
of two uniformly charged parallel cylinders, or a sphere near an infinite charged plane. 

Example 2.5 

Two infinite parallel planes carry equal hut opposite uniform charge densities ±<r (Fig. 2.23). 
Find the field in each of the three regions: (ij to the left of both, (ii) between them, (iii) to the 
right of both. 

Solution: The left plate produces a field (l/2eo)ff which points away from it (Fig. 2.24) — to 
the left in region (i) and to the right in regions (ii) and (iii). The right plate, being negatively 
charged, produces a field (l/2eo)p', which points toward it — to the right in regions (i) and 
(ii) and to the left in region (iii). The two fields cancel in regions (i) and (iii); they conspire 
in region (ii). Conclusion: The field is (1 /cq)o, and points to the right, between the planes; 
elsewhere it is zero. 



(i) (ii) - .(iii) 



- 

"i 



+ct -o 



E_ 



(i) 



E_ 



(ii) 



E. 



(iii) 



+(T 



Figure 2-23 



Figure 2.24 
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Problem 2.11 Use Gauss’s law to find the electric field inside and outside a spherical shell of 
radius R, which carries a uniform surface charge density a . Compare your answer to Prob. 2.7. 

Problem 2.12 Use Gauss’s law to find the electric field inside a uniformly charged sphere 
(charge density p). Compare your answer to Prob. 2.8. 

Problem 2.13 Find the electric field a distance s from an infinitely long straight wire, which 
carries a uniform line charge A. Compare Eq. 2.9. 

Problem 2.14 Find the electric field inside a sphere which carries a charge density proportional 
to the distance from the origin, p = kr, for some constant k. [Hint: This charge density is not 
uniform, and you must integrate to get the enclosed charge.] 

Problem 2.15 A hollow spherical shell carries charge density 

k 

P= ~2 

in the region a < r < b (Fig. 2.25). Find the electric field in the three regions: (i) r < a, (ii) 
a < r < b, (iii) r > b. Plot |E| as a function of r. 

Problem 2.16 A long coaxial cable (Fig. 2.26) carries a uniform volume charge density p on 
the inner cylinder (radius a), and a uniform surface charge density on the outer cylindrical 
shell (radius b). This surface charge is negative and of just the right magnitude so that the 
cable as a whole is electrically neutral. Find the electric field in each of the three regions: (i) 
inside the inner cylinder (s < a), (ii) between the cylinders (a < s < b), (iii) outside the cable 
(s > b). Plot |E| as a function of s. 

Problem 2.17 An infinite plane slab, of thickness 2d, carries a uniform volume charge density 
p (Fig. 2.27). Find the electric field, as a function of y, where y = 0 at the center. Plot E 
versus y, calling E positive when it points in the +y direction and negative when it points in 
the — y direction. 



Problem 2.18 Two spheres, each of radius R and carrying uniform charge densities +p and 
—p, respectively, are placed so that they partially overlap (Fig. 2.28). Call the vector from the 
positive center to the negative center d. Show that the field in the region of overlap is constant, 
and find its value. [Hint: Use the answer to Prob. 2.12.] 




Figure 2.25 



Figure 2.26 




2.2.4 The Curl of E 

I’ll calculate the curl of E, as I did the divergence in Sect. 2.2. 1 , by studying first the simplest 
possible configuration: a point charge at the origin. In this case 

E = 

4ic€o r l 

Now, a glance at Fig. 2.12 should convince you that the curl of this field has to be zero, but 
I suppose we ought to come up with something a little more rigorous than that. What if we 
calculate the line integral of this field from some point a to some other point b (Fig. 2.29): 

f E ■ d\. 

J a 

In spherical coordinates, d\ = dr r + r dd 9 + r sin 0 d<p <j>, so 



E-c/l = 



Therefore, 



[ E-dl= — — [ ^ dr = 
Ja 4jre 0 Ja r 



4ic€o r 2 

-1 q 

4tt r 



where r a is the distance from the origin to the point a and >'i, is the distance to b. The 
integral around a closed path is evidently zero (for then r a = r/,): 



E • d\ = 0, 
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and hence, applying Stokes’ theorem, 



V x E = 0. 



( 2 . 20 ) 



Now, I proved Eqs. 2. 1 9 and 2.20 only for the held of a single point charge at the origin , 
but these results make no reference to what is, after all, a perfectly arbitrary choice of 
coordinates; they also hold no matter where the charge is located. Moreover, if we have 
many charges, the principle of superposition states that the total held is a vector sum of 
their individual helds: 

E = E] + E 2 + . . . , 



so 

V x E = V x (E] + E 2 + . . .) = (V x Ei) + (V x E 2 ) + . . . = 0. 

Thus, Eqs. 2.19 and 2.20 hold for any static charge distribution whatever. 



Problem 2.19 Calculate V x E directly from Eq. 2.8, by the method of Sect. 2.2.2. Refer to 
Prob. 1.62 if you get stuck. 



2 J Electric Potential 

2.3.1 Introduction to Potential 

The electric held E is not just any old vector function; it is a very special kind of vector 
function, one whose curl is always zero. E = yx, for example, could not possibly be 
an electrostatic held; no set of charges, regardless of their sizes and positions, could ever 
produce such a held. In this section we’re going to exploit this special property of electric 
helds to reduce a vector problem (hnding E) down to a much simpler scalar problem. The 
hrst theorem in Sect. 1 .6.2 asserts that any vector whose curl is zero is equal to the gradient 
of some scalar. What I’m going to do now amounts to a proof of that claim, in the context 
of electrostatics. 
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b 



a 



(ii) 



Figure 2.30 



Because V x E = 0, the line integral of E around any closed loop is zero (that follows 
from Stokes’ theorem). Because § E • dl = 0, the line integral of E from point a to point 
b is the same for all paths (otherwise you could go out along path (i) and return along path 
(ii) — Fig. 2.30 — and obtain E • dl ^ 0). Because the line integral is independent of path, 
we can define a function 4 






V (r) = - E • dl. 



( 2 . 21 ) 



Here O is some standard reference point on which we have agreed beforehand; V then 
depends only on the point r. It is called the electric potential. 

Evidently, the potential difference between two points a and b is 



V (b) - V (a) 




E-dl + 
E- dl- 




( 2 . 22 ) 



Now, the fundamental theorem for gradients states that 



V(b) - V(a) = f (VV) - dl, 



so 




(VV) dl = 




E ■ dl. 



Since, finally, this is true for any points a and b, the integrands must be equal: 



E = -VV. 



(2.23) 



Equation 2.23 is the differential version of Eq. 2.21; it says that the electric field is the 
gradient of a scalar potential, which is what we set out to prove. 

4 To avoid any possible ambiguity I should perhaps put a prime on the integration variable: 

V(r) = - f E(r') ■ d\'. 

Jo 

But this makes for cumbersome notation, and I prefer whenever possible to reserve the primes for source points. 
However, when (as in Ex. 2.6) we calculate such integrals explicitly, I shall put in the primes. 
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Notice the subtle but crucial role played by path independence (or, equivalently, the fact 
that V x E = 0) in this argument. If the line integral of E depended on the path taken, then 
the “definition” of V, Eq. 2.21, would be nonsense. It simply would not define a function, 
since changing the path would alter the value of V(r). By the way, don’t let the minus sign 
in Eq. 2.23 distract you; it carries over from 2.21 and is largely a matter of convention. 



Problem 2.20 One of these is an impossible electrostatic field. Which one? 

(a) E = k fry x + 2_vz y + 3xz z]; 

(b) E = k[y 2 x + (2xy + z 2 ) y + lyz z]. 

Here k is a constant with the appropriate units. For the possible one, find the potential, using 
the origin as your reference point. Check your answer by computing VF. [Hint: You must 
select a specific path to integrate along. It doesn’t matter what path you choose, since the 
answer is path-independent, but you simply cannot integrate unless you have a particular path 
in mind.] 



2.3.2 Comments on Potential 

(i) The tjgme. The word “potential” is a hideous misnomer because it inevitably 
reminds you of potential energy. This is particularly confusing, because there is a connection 
between “potential” and “potential energy,” as you will see in Sect. 2.4. I’m sorry' that it 
is impossible to escape this word. The best I can do is to insist once and for all that 
“potential” and “potential energy” are completely different terms and should, by all rights, 
have different names. Incidentally, a surface over which the potential is constant is called 
an equipotential. 

(ii) Advantage of the potential formulation. If you know V, you can easily get 
E — just take the gradient: E = — V V. This is quite extraordinary when you stop to think 
about it, for E is a vector quantity (three components), but V is a scalar (one component). 
How can one function possibly contain all the information that three independent functions 
carry? The answer is that the three components of E are not really as independent as 
they look; in fact, they are explicitly interrelated by the very condition we started with, 
V x E = 0. In terms of components, 

3E ± _dE L 9£j _ dE y dE x _ dE z 
3 y dx 3y dz ' dz fix 

This brings us back to my observation at the beginning of Sect. 2.3.1: E is a very special 
kind of vector. What the potential formulation does is to exploit this feature to maximum 
advantage, reducing a vector problem down to a scalar one, in which there is no need to 
fuss with components. 
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(iii) The reference point O. There is an essential ambiguity in the definition of 
potential, since the choice of reference point O was arbitrary. Changing reference points 
amounts to adding a constant K to the potential: 

/*r pO pr 

V'(r) = - Edl = - E c/1 — / E • d\ = K + V(r), 

JO' JO' JO 

where K is the line integral of E from the old reference point O to the new one O'. Of 
course, adding a constant to V will not affect the potential difference between two points: 

V\b) - V'(a) = V(b) - V(a), 

since the K ’s cancel out. (Actually, it was already clear from Eq. 2.22 that the potential 
difference is independent of O, because it can be written as the line integral of E from a to 
b, with no reference to O.) Nor does the ambiguity affect the gradient of V: 

vv' = vv, 

since the derivative of a constant is zero. That’s why all such V’s, differing only in their 
choice of reference point, correspond to the same field E. 

Evidently potential as such carries no real physical significance, for at any given point 
we can adjust its value at will by a suitable relocation of O. In this sense it is rather like 
altitude: If I ask you how high Denver is, you will probably tell me its height above sea level, 
because that is a convenient and traditional reference point. But we could as well agree 
to measure altitude above Washington D.C., or Greenwich, or wherever. That would add 
(or, rather, subtract) a fixed amount from all our sea-level readings, but it wouldn’t change 
anything about the real world. The only quantity of intrinsic interest is the difference in 
altitude between two points, and that is the same whatever your reference level. 

Having said this, however, there is a “natural” spot to use for O in electrostatics — 
analogous to sea level for altitude — and that is a point infinitely far from the charge. Or- 
dinarily, then, we “set the zero of potential at infinity.” (Since V(0) = 0, choosing a 
reference point is equivalent to selecting a place where V is to be zero.) But I must warn 
you that there is one special circumstance in which this convention fails: when the charge 
distribution itself extends to infinity. The symptom of trouble, in such cases, is that the 
potential blows up. For instance, the field of a uniformly charged plane is (a/ 2eo)n, as we 
found in Ex. 2.4; if we naively put O = oo, then the potential at height r above the plane 
becomes 

f z 1 1 

V(z) = -I - — adz = -- — a{z - oo). 

Joo 2eo 2eo 

The remedy is simply to choose some other reference point (in this problem you might use 
the origin). Notice that the difficulty occurs only in textbook problems; in “real life” there 
is no such thing as a charge distribution that goes on forever, and we can always use infinity 
as our reference point. 
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(iv) Potential obeys the superposition principle. The original superposition princi- 
ple of electrodynamics pertains to the force on a test charge Q. It says that the total force 
on Q is the vector sum of the forces attributable to the source charges individually: 



F = Ft + F2 + . . . 

Dividing through by Q,we find that the electric field, too, obeys the superposition principle: 

E = Ei ~J- E2 + . . . 



Integrating from the common reference point to r, it follows that the potential also satisfies 
such a principle: 



V — V\ + V 2 + • . . 



That is, the potential at any given point is the sum of the potentials due to all the source 
charges separately. Only this time it is an ordinary sum, not a vector sum, which makes it 
a lot easier to work with. 

(v) Units of Potential. In our units, force is measured in newtons and charge in 
coulombs, so electric fields are in newtons per coulomb. Accordingly, potential is measured 
in newton-meters per coulomb or joules per coulomb. A joule per coulomb is called a volt. 



Example 2.6 



Find the potential inside and outside a spherical shell of radius R (Fig. 2.31), which carries a 
uniform surface charge. Set the reference point at infinity. 




Figure 2.31 
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Solution: From Gauss’s law, the field outside is 



4jt€q r 2 

where q is the total charge on the sphere. The field inside is zero. For points outside the sphere 

(r > R), 



V(r) = - 



(\-d\ = —^- f 4 

JO 4tt€ 0 J co r'- 



dr' = 



4jt€o r' 



1 q 

47T€o r 



To find the potential inside the sphere (r < R), we must break the integral into two sections, 
using in each region the field that prevails there: 



V(0 = Y~~f “ 7 ? dr '~f (°) dr ' 

47T€o Joo r a Jr 



1 q 



4neo r' 



+ 0 = -. 

CO 4tV€q R 



Notice that the potential is not zero inside the shell, even though the field is. V is a constant 
in this region, to be sure, so that VP = 0 — that’s what matters. In problems of this type you 
must always work your way in from the reference point; that’s where the potential is “nailed 
down.’’ It is tempting to suppose that you could figure out the potential inside the sphere on 
the basis of the field there alone, but this is false: The potential inside the sphere is sensitive to 
what’s going on outside the sphere as well. If I placed a second uniformly charged shell out at 
radius R' > R, the potential inside R would change, even though the field would still be zero. 
Gauss’s law guarantees that charge exterior to a given point (that is, at larger r) produces no 
net field at that point, provided it is spherically or'cylindrically symmetric; but there is no such 
rule for potential, when infinity is used as the reference point. 



Problem 2.21 Find the potential inside and outside a uniformly charged solid sphere whose 
radius is R and whose total charge is q. Use infinity as your reference point. Compute the 
gradient of V in each region, and check that it yields the correct field. Sketch V (r). 

Problem 2.22 Find the potential a distance s from an infinitely long straight wire that carries 
a uniform line charge X. Compute the gradient of your potential, and check that it yields the 
correct field. 

Problem 2.23 For the charge configuration of Prob. 2.15, find the potential at the center, using 
infinity as your reference point. 

Problem 2.24 For the configuration of Prob. 2.16, find the potential difference between a point 
on the axis and a point on the outer cylinder. Note that it is not necessary to commit yourself 
to a particular reference point if you use Eq. 2.22. 
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2.3.3 Poisson’s Equation and Laplace’s Equation 

We found in Sect. 2.3.1 that the electric held can be written as the gradient of a scalar 
potential. 

E = —VP. 

The question arises: What do the fundamental equations for E, 

V • E = — and V x E = 0, 
co 

look like, in terms of V? Well, V • E = V • (— V V) = — V 2 V, so, apart from that persisting 
minus sign, the divergence of E is the Laplacian of V. Gauss’s law then says that 



V 2 V = - — . 
co 



(2.24) 



This is known as Poisson’s equation. In regions where there is no charge, so that p = 0, 
Poisson’s equation reduces to Laplace’s equation, 

v 2 v = 0. (2.25) 

We’ll explore these equations more fully in Chapter 3. 

So much for Gauss’s law. What about the curl law? This says that 

V x E = V x (— V V) 

must equal zero. But that’s no condition on V — curl of gradient is always zero. Of course, 
we used the curl law to show that E could be expressed as the gradient of a scalar, so it’s not 
really surprising that this works out: V x E = 0 permits E = -V V; in return, E = -V V 
guarantees V x E = 0. It takes only one differential equation (Poisson’s) to determine V, 
because V is a scalar; for E we needed two, the divergence and the curl. 



2.3.4 The Potential of a Localized Charge Distribution 

I defined V in terms of E (Eq. 2.21). Ordinarily, though, it’s E that we’re looking for (if we 
already knew E there wouldn’t be much point in calculating V). The idea is that it might be 
easier to get V first, and then calculate E by taking the gradient. Typically, then, we know 
where the charge is (that is, we know p), and we want to find V . Now, Poisson’s equation 
relates V and p, but unfortunately it’s “the wrong way around”: it would give us p. if we 
knew V, whereas we want V, knowing p. What we must do, then, is “invert” Poisson’s 
equation. That’s the program for this section, although I shall do it by roundabout means, 
beginning, as always, with a point charge at the origin. 
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Figure 2.32 



Setting the reference point at infinity, the potential of a point charge q at the origin is 

1 q 

4rreo r ' 

(You see here the special virtue of using infinity for the reference point: it kills the lower 
limit on the integral.) Notice the sign of V ; presumably the conventional minus sign in 
the definition of V (Eq. 2.21) was chosen precisely in order to make the potential of a 
positive charge come out positive. It is useful to remember that regions of positive charge 
are potential “hills,” regions of negative charge are potential “valleys,” and the electric field 
points “downhill,” from plus toward minus. 

In general, the potential of a point charge q is 



V(r) = 



-1 

4ne 0 



f 



Cv dr ' 



1 g 

47reo r' 



V (r) = 



1 q 

4lC€0 4 



(2.26) 



where n,, as always, is the distance from the charge to r (Fig. 2.32). Invoking the superpo- 
sition principle, then, the potential of a collection of charges is 



V (r) 



1 £ <h_ 

4jT £o ; = ] 4j 



or, for a continuous distribution, 



V(r) = 




(2.27) 



(2.28) 



In particular, for a volume charge, it’s 



V(r) = 





(2.29) 



This is the equation we were looking for, telling us how to compute V when we know p ; it 
is, if you like, the “solution” to Poisson’s equation, for a localized charge distribution. 5 I 

Equation 2.29 is an example of the Helmholtz theorem (Appendix B), in the context of electrostatics, where 
the curl of E is zero and its divergence is p/e q. 
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invite you to compare Eq. 2.29 with the corresponding formula for the electric field in terms 
of p (Eq. 2.8): 



E(r) = 





/ 



The main point to notice is that the pesky unit vector i is now missing, so there is no need 
to worry about components. Incidentally, the potentials of line and surface charges are 



1 

47T€ 0 



/ 




and 





(2.30) 



I should warn you that everything in this section is predicated on the assumption that 
the reference point is at infinity. This is hardly apparent in Eq. 2.29, but remember that we 
got that equation from the potential of a point charge at the origin, (1/4 7 t€ 0 )(q/r), which 
is valid only when O = oo. If you try to apply these formulas to one of those artificial 
problems in which the charge itself extends to infinity, the integral will diverge. 



Example 2.7 



Find the potential of a uniformly charged spherical shell of radius R (Fig. 2.33). 



Solution: This is the same problem we solved in Ex. 2.6, but this time we shall do it using 
Eq. 2.30: 



V(r) 



= —( 

4lt 6Q J 



■ da'. 



Let s set the point r on the z axis and use the law of cosines to express in terms of the polar 
angle 9: 

C 1 = R 2 + Z 2 - 2Rz cos O' . 




Figure 2.33 
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An element of surface area on this sphere is R 2 sin 9' d9' dtp ' , so 



4jt6 0 V(z) 



7 



= 2ttR z <j 



R 2 sin 9' d9' dtp' 

'Jr 2 + z 2 - 2Rz cos 9' 
f 71 sind' 

JO 



Jr 2 + z 2 -2Rzcos9' 



d9' 



= 2jtR 2 a 




2Rz cos 9' 



n 

0 



= {J R 2 + z 2 + 2R Z - y/ R 2 + z 2 - 2 Rz^ 

= 2 ^~ |V(fl + z) 2 - V(R ~ z) 2 ] ■ 

At this stage we must be very careful to take the positive root. For points outside the sphere, z is 
greater than R, and hence s/(R — z) 2 = z — R', forpoints inside the sphere, v(/? — z) 2 = R—z. 
Thus, 



Ra R 2 a 

V(z) = [(/? + z) - (z - /?)] = , outside; 

2e 0 z e 0 z 

Ra Ra 

V(z) = [(/? + z) - (/? - z)] = — , inside. 

2e 0 z 6 0 

In terms of the total charge on the shell, q = 4 nR 2 a, V (z) = (l/4jreo)(<?/z) (or, in general, 
V(r) = ( 1 / 47 T 60 ){q/r)) forpoints outside the sphere, and (\/47teo)(q/ R) forpoints inside. 

Of course, in this particular case, it was easier to get V by using 2.21 than 2.30, because 
Gauss’s law gave us E with so little effort. But if you compare Ex. 2.7 with Prob. 2.7, you will 
appreciate the power of the potential formulation. 



Problem 2.25 Using Eqs. 2.27 and 2.30, find the potential at a distance z above the center of 
the charge distributions in Fig. 2.34. In each case, compute E = — VF, and compare your 
answers with Prob. 2.2a, Ex. 2.1, and Prob. 2.6, respectively. Suppose that we changed the 
right-hand charge in Fig. 2.34a to —q; what then is the potential at PI What field does that 
suggest? Compare your answer to Prob. 2.2b, and explain carefully any discrepancy. 



* P 

i 

i 

z i 



+q d +q 

(a) Two point charges 



• P 

i 

i 

z i 

i 

i 

i 

XJ 

2 L 

(b) Uniform line charge 



• P 




(c) Uniform surface charge 



Figure 2.34 
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Problem 2.26 A conical surface (an empty ice-cream cone) carries a uniform surface charge 
a. The height of the cone is h, as is the radius of the top. Find the potential difference between 
points a (the vertex) and b (the center of the top). 



Problem 2.27 Find the potential on the axis of a uniformly charged solid cylinder, a distance 
z from the center. The length of the cylinder is L, its radius is R, and the charge density is p. 
Use your result to calculate the electric field at this point. (Assume that z > L /2.) 



Problem 2.28 Use Eq. 2.29 to calculate the potential inside a uniformly charged solid sphere 
of radius R and total charge q. Compare your answer to Prob. 2.21. 

Problem 2.29 Check that Eq. 2.29 satisfies Poisson’s equation, by applying the Laplacian and 
using Eq. 1.102. 



2.3.5 Summary; Electrostatic Boundary Conditions 

In the typical electrostatic problem you are given a source charge distribution p, and you 
want to find the electric field E it produces. Unless the symmetry of the problem admits a 
solution by Gauss’s law, it is generally to your advantage to calculate the potential first, as 
an intermediate step. These, then, are the three fundamental quantities of electrostatics: p, 
E, and V. We have, in the course of our discussion, derived all six formulas interrelating 
them. These equations are neatly summarized in Fig. 2.35. We began with just two exper- 
imental observations: (1) the principle of superposition — a broad general rule applying to 
all electromagnetic forces, and (2) Coulomb’s law — the fundamental law of electrostatics. 
From these, all else followed. 




Figure 2.35 
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You may have noticed, in studying Exs. 2.4 and 2.5, or working problems such as 2.7, 
2.11, and 2.16, that the electric held always undergoes a discontinuity when you cross a 
surface charge a. In fact, it is a simple matter to find the amount by which E changes at 
such a boundary. Suppose we draw a wafer-thin Gaussian pillbox, extending just barely 
over the edge in each direction (Fig. 2.36). Gauss’s law states that 

(& E • d a = Q enc = tr A , 

J <?0 60 

5 

where A is the area of the pillbox lid. (If a varies from point to point or the surface is 
curved, we must pick A to be extremely small.) Now, the sides of the pillbox contribute 
nothing to the flux, in the limit as the thickness e goes to zero, so we are left with 

£ atove - £ below = ^ a ’ < 2 ' 31) 

<?0 

where denotes the component of E that is perpendicular to the surface immediately 

above, and Ej^iow is the same, only just below the surface. For consistency, we let “upward” 
be the positive direction for both. Conclusion: The normal component of E is discontinuous 
by an amount a /e o at any boundary. In particular, where there is no surface charge, E 1 is 
continuous, as for instance at the surface of a uniformly charged solid sphere. 

The tangential component of E, by contrast, is always continuous. For if we apply 
Eq. 2.19, 



E ■ dl = 0, 



to the thin rectangular loop of Fig. 2.37, the ends give nothing (as 6 ->■ 0), and the sides 



8 ive ( £ above / 



^ below 



/), so 



17 II _ 17 II 

^above ^below’ 



(2.32) 
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Figure 2.37 



where stands for the components of E parallel to the surface. The boundary conditions 
on E (Eqs. 2.31 and 2.32) can be combined into a single formula: 

Eabove Ebelow = h. (2.33) 

<?0 

where n is a unit vector perpendicular to the surface, pointing from “below” to “above.” 6 
The potential, meanwhile, is continuous across any boundary (Fig. 2.38), since 

f b 

Tabove Tbelow — 1 / E • t/1 

J a 

as the path length shrinks to zero, so too does the integral: 



Tabove — ^below • 



(2.34) 




Figure 2.38 



^Notice that it doesn’t matter which side you call “above” and which “below,” since reversal would switch the 
direction of n. Incidentally, if you’re only interested in the field due to the (essentially flat) local patch of surface 
charge itself the answer is (<r/2e 0 )n immediately above the surface, and -(<r/2e 0 )n immediately below. This 
follows from Ex. 2.4, for if you are close enough to the patch it “looks” like an infinite plane. Evidently the entire 
discontinuity in E is attributable to this local patch of charge. 
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However, the gradient of V inherits the discontinuity in E; since E = — W, Eq. 2.33 
implies that 



or, more conveniently, 



^i^above ^^below — <rh, 
<?0 



9 Eabove 9 Fbelow 
dn dn 



1 

— O', 
<?0 



(2.35) 

(2.36) 



where 



dV 

— = VV n 
dn 



(2.37) 



denotes the normal derivative of V (that is, the rate of change in the direction perpendicular 
to the surface). 

Please note that these boundary conditions relate the fields and potentials just above and 
just below the surface. For example, the derivatives in Eq. 2.36 are the limiting values as 
we approach the surface from either side. 



Problem 2.30 

(a) Check that the results of Exs. 2.4 and 2.5, and Prob. 2.11, are consistent with Eq. 2.33. 

(b) Use Gauss’s law to find the field inside and outside a long hollow cylindrical tube, which 
carries a uniform surface charge a . Check that your result is consistent with Eq. 2.33. 

(c) Check that the result of Ex. 2.7 is consistent with boundary conditions 2.34 and 2.36. 



2.4 Work and Energy in Electrostatics 

2.4.1 The Work Done to Move a Charge 

Suppose you have a stationary configuration of source charges, and you want to move a test 
charge Q from point a to point b (Fig. 2.39). Question: How much work will you have to 
do? At any point along the path, the electric force on Q is F = QE; the force you must 
exert, in opposition to this electrical force, is — QE. (If the sign bothers you, think about 
lifting a brick: Gravity exerts a force mg downward , but you exert a force mg upward. Of 
course, you could apply an even greater force — then the brick would accelerate, and part 



a» 



<?!• 



ft' 






b 



)) Q 



Figure 2.39 
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of your effort would be “wasted” generating kinetic energy. What we’re interested in here 
is the minimum force you must exert to do the job.) The work is therefore 

/ b pb 

F ■ dl = -Q / E • d\ = Q[V(b) - V(a)]. 

J a 

Notice that the answer is independent of the path you take from a to b; in mechanics, then, 
we would call the electrostatic force “conservative.” Dividing through by Q, we have 

V(b)-V(a) = ^. (2.38) 

In words, the potential difference between points a and b is equal to the work per unit charge 
required to carry a particle from a to b. In particular, if you want to bring the charge Q in 
from far away and stick it at point r, the work you must do is 

W = Q[V( r) - V (oo)], 

so, if you have set the reference point at infinity, 

W=QV( r). (2.39) 

In this sense potential is potential energy (the work it takes to create the system) per unit 
charge (just as the field is the force per unit charge). 



2.4.2 The Energy of a Point Charge Distribution 



How much work would it take to assemble an entire collection of point charges? Imagine 
bringing in the charges, one by one, from far away (Fig. 2.40). The first charge, q\, takes 
no work, since there is no held yet to fight against. Now bring in q 2 . According to Eq. 2.39, 
this will cost you <72 Tito), where Vj is the potential due to q\. and ri is the place we’re 
putting q 2 : 



W 2 = 



i fqi 

~ <72 — 

4ne o \-&i2 





<?2 



Figure 2.40 
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($■12 is the distance between q\ and qn once they are in position). Now bring in qy, this 
requires work q^V i , 2 ( r 3 ) , where V\ 2 is the potential due to charges q\ and qi, namely, 
(1/47T€ 0 )(^i/$ 13 +?2/$23). Thus 




Similarly, the extra work to bring in <74 will be 



W4 






q 4 



42_ + 93_ 
$14 $24 $34 



The total work necessary to assemble the first four charges, then, is 



_ 1 / gig2 giq3 qiqi 

4 jt€o \ $12 $13 $14 



ff2ff3 ff2ff4 ff3ff4 

$23 $24 $34 



You see the general rule: Take the product of each pair of charges, divide by their separation 
distance, and add it all up: 



W = 



1 

4;reo 



EE 



i'=l i = 1 



qtqj 



(2.40) 



J>1 



The stipulation j > i is just to remind you not to count the same pair twice. A nicer way 
to accomplish the same purpose is intentionally to count each pair twice, and then divide 
by 2: 



W = 






8;reo 



j=i 






(2.41) 



(we must still avoid i = /, of course). Notice that in this form the answer plainly does not 
depend on the order in which you assemble the charges, since every pair occurs in the sum. 
Let me next pull out the factor qi : 






i = 1 



E 



\ 

1 9 j 
4 TT 60 $i j 



The term in parentheses is the potential at point r; (the position of q , ) due to all the other 
charges — all of them, now, not just the ones that were present at some stage in the building- 
up process. Thus, 

1 " 

W = -J2q i V(r i ). (2.42) 

z /= 1 



That’s how much work it takes to assemble a configuration of point charges; it’s also the 
amount of work you’d get back out if you dismantled the system. In the meantime, it 
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represents energy stored in the configuration (“potential” energy, if you like, though for 
obvious reasons I prefer to avoid that word in this context). 



Problem 2.31 

(a) Three charges are situated at the comers of a square (side a), as shown in Fig. 2.41. How 
much work does it take to bring in another charge, +q . from far away and place it in the fourth 
comer? 

(b) How much work does it take to assemble the whole configuration of four charges? 






+q 



a 



a 



-q 



Figure 2.41 



2.4.3 The Energy of a Continuous Charge Distribution 

For a volume charge density p, Eq. 2.42 becomes 

W = {J pVdr. 



(2.43) 



(The corresponding integrals for line and surface charges would be f XV dl and f aV da, 
respectively.) There is a lovely way to rewrite this result, in which p and V are eliminated 
in favor of E. First use Gauss’s law to express p in terms of E: 



p = e 0 V ■ E, 



so 




J (V -E)Vdr. 



Now use integration by parts (Eq. 1.59) to transfer the derivative from E to V : 




j E-(VV)dr + 



VE • da \ . 



But W = — E, so 




(2.44) 
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But what volume is this we’re integrating over? Let’s go back to the formula we started 
with, Eq. 2.43. From its derivation, it is clear that we should integrate over the region 
where the charge is located. But actually, any larger volume would do just as well: The 
“extra” territory we throw in will contribute nothing to the integral anyway, since p = 0 
out there. With this in mind, let’s return to Eq. 2.44. What happens here, as we enlarge the 
volume beyond the minimum necessary to trap all the charge? Well, the integral of E 2 can 
only increase (the integrand being positive); evidently the surface integral must decrease 
correspondingly to leave the sum intact. In fact, at large distances from the charge, E goes 
like 1/r 2 and V like 1 fr, while the surface area grows like r 2 . Roughly speaking, then, 
the surface integral goes down like 1/r. Please understand that Eq. 2.44 gives you the 
correct energy W, whatever volume you use (as long as it encloses all the charge), but the 
contribution from the volume integral goes up, and that of the surface integral goes down, 
as you take larger and larger volumes. In particular, why not integrate over all space? Then 
the surface integral goes to zero, and we are left with 




(2.45) 



Example 2.8 

Find the energy of a uniformly charged spherical shell of total charge q and radius R. 



Solution 1: Use Eq. 2.43, in the version appropriate to surface charges: 




Now, the potential at the surface of this sphere is (l/AjTeQ)q/ R (a constant), so 




Solution 2: Use Eq. 2.45. Inside the sphere E = 0; outside. 



E = 




(4ireo) 2 r 4 



Therefore, 




outside 
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Problem 2.32 Find the energy stored in a uniformly charged solid sphere of radius R and 
charge q. Do it three different ways: 

(a) Use Eq. 2.43. You found the potential in Prob. 2.21. 

(b) Use Eq. 2.45. Don’t forget to integrate over all space. 

(c) Use Eq. 2.44. Take a spherical volume of radius a. Notice what happens as a — >■ oo. 

Problem 2.33 Here is a fourth way of computing the energy of a uniformly charged sphere: 
Assemble the sphere layer by layer, each time bringing in an infinitesimal charge d q from far 
away and smearing it uniformly over the surface, thereby increasing the radius. How much 
work d W does it take to build up the radius by an amount dr? Integrate this to find the work 
necessary to create the entire sphere of radius R and total charge q. 



2.4.4 Comments on Electrostatic Energy 

(i) A perplexing “inconsistency.” Equation 2.45 clearly implies that the energy of a 
stationary charge distribution is always positive. On the other hand, Eq. 2.42 (from which 
2.45 was in fact derived), can be positive or negative. For instance, according to 2.42, the 
energy of two equal but opposite charges a distance apart would be —(I /4ne.o)(q 2 /D- 
What’s gone wrong? Which equation is correct? 

The answer is that both equations are correct, but they pertain to slightly different 
situations. Equation 2.42 does not take into account the work necessary to make the point 
charges in the first place; we started with point charges and simply found the work required 
to bring them together. This is wise policy, since Eq. 2.45 indicates that the energy of a 
point charge is in fact infinite : 



W = 



e o f /V 

2(47re 0 ) 2 J V 4 



(r 2 si Ad dr dO dip) 



q 2 f°° 
87T60 Jo 



1 

— dr = oo. 
r z 



Equation 2.45 is more complete, in the sense that it tells you the total energy stored in 
a charge configuration, but Eq. 2.42 is more appropriate when you’re dealing with point 
charges, because we prefer (for good reason!) to leave out that portion of the total energy 
that is attributable to the fabrication of the point charges themselves. In practice, after 
all, the point charges (electrons, say) are given to us ready-made; all we do is move them 
around. Since we did not put them together, and we cannot take them apart, it is immaterial 
how much work the process would involve. (Still, the infinite energy of a point charge 
is a recurring source of embarrassment for electromagnetic theory, afflicting the quantum 
version as well as the classical. We shall return to the problem in Chapter 11.) 

Now, you may wonder where the inconsistency crept into an apparently water-tight 
derivation. The “flaw” lies between Eqs. 2.42 and 2.43: In the former, U(r,) represents 
the potential due to all the other charges but not q,. whereas in the latter, V (r ) is the full 
potential. For a continuous distribution there is no distinction, since the amount of charge 
right at the point r is vanishingly small, and its contribution to the potential is zero. 
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(ii) Where is the energy stored? Equations 2.43 and 2.45 offer two different ways of 
calculating the same thing. The first is an integral over the charge distribution; the second 
is an integral over the field. These can involve completely different regions. For instance, 
in the case of the spherical shell (Ex. 2.8) the charge is confined to the surface, whereas the 
electric field is present everywhere outside this surface. Where is the energy, then? Is it 
stored in the field, as Eq. 2.45 seems to suggest, or is it stored in the charge, as Eq. 2.43 
implies? At the present level, this is simply an unanswerable question: I can tell you what 
the total energy is, and I can provide you with several different ways to compute it, but it is 
unnecessary to worry about where the energy is located. In the context of radiation theory 
(Chapter 11) it is useful (and in General Relativity it is essential) to regard the energy as 
being stored in the field, with a density 



£o 

2 



E 2 



energy per unit volume. 



(2.46) 



But in electrostatics one could just as well say it is stored in the charge, with a density \pV . 
The difference is purely a matter of bookkeeping. 

(iii) The superposition principle. Because electrostatic energy is quadratic in the 
fields, it does not obey a superposition principle. The energy of a compound system is not 
the sum of the energies of its parts considered separately — there are also “cross terms”: 

W tot = f/ £2dT = J /( £ l + E 2) 2 dr 

= ^ j(E 2 + E 2 + 2E ] -E 2 )dz 

= Wi + W 2 + e 0 j E] • E 2 dr. (2.47) 

For example, if you double the charge everywhere, you quadruple the total energy. 



Problem 2.34 Consider two concentric spherical shells, of radii a and h. Suppose the inner 
one carries a charge q, and the outer one a charge — q (both of them uniformly distributed 
over the surface). Calculate the energy of this configuration, (a) using Eq. 2.45, and (b) using 
Eq. 2.47 and the results of Ex. 2.8. 



2.5 Conductors 

2.5.1 Basic Properties 

In an insulator, such as glass or rubber, each electron is attached to a particular atom. In a 
metallic conductor, by contrast, one or more electrons per atom are free to roam about at will 
through the material. (In liquid conductors such as salt water it is ions that do the moving.) 
A perfect conductor would be a material containing an unlimited supply of completely free 
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charges. In real life there are no perfect conductors, but many substances come amazingly 
close. From this definition the basic electrostatic properties of ideal conductors immediately 
follow: 

(i) E = 0 inside a conductor. Why? Because if there were any field, those free charges 
would move, and it wouldn’t be electro statics any more. Well . . . that’s hardly a satisfactory 
explanation; maybe all it proves is that you can’t have electrostatics when conductors are 
present. We had better examine what happens when you put a conductor into an external 
electric field Eo (Fig. 2.42). Initially, this will drive any free positive charges to the right, 
and negative ones to the left. (In practice it’s only the negative charges — electrons — that 
do the moving, but when they depart the right side is left with a net positive charge — the 
stationary nuclei — so it doesn’t really matter which charges move; the effect is the same.) 
When they come to the edge of the material, the charges pile up: plus on the right side, 
minus on the left. Now, these induced charges produce a field of their own, Ei , which, as 
you can see from the figure, is in the opposite direction to Eo. That’s the crucial point, for 
it means that the field of the induced charges tends to cancel off the original field. Charge 
will continue to flow until this cancellation is complete, and the resultant field inside the 
conductor is precisely zero. 7 The whole process is practically instantaneous. 
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Figure 2.42 




(ii) p = 0 inside a conductor. This follows from Gauss’s law: V E = p/c o- If E = 0, 
so also is p. There is still charge around, but exactly as much plus charge as minus, so the 
net charge density in the interior is zero. 

(iii) Any net charge resides on the surface. That’s the only other place it can be. 

(iv) A conductor is an equipotential. For if a and b are any two points within (or at the 
surface of) a given conductor, V (b) - V (a) = - / a b E • d\ = 0, and hence V (a) = V (b). 



7 Outside the conductor the field is not zero, for here Eq and Ej do not cancel. 
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Figure 2.43 



(v) E is perpendicular to the surface, just outside a conductor. Otherwise, as in (i), 
charge will immediately flow around the surface uiitil it kills off the tangential component 
(Fig. 2.43). ( Perpendicular to the surface, charge cannot flow, of course, since it is confined 
to the conducting object.) 

I think it is strange that the charge on a conductor flows to the surface. Because of their 
mutual repulsion, the charges naturally spread out as much as possible, but for all of them 
to go to the surface seems like a waste of the interior space. Surely we could do better, from 
the point of view of making each charge as far as possible from its neighbors, to sprinkle 
some of them throughout the volume. . . Well, it simply is not so. You do best to put all the 
charge on the surface, and this is true regardless of the size or shape of the conductor. 8 

The problem can also be phrased in terms of energy. Like any other free dynamical 
system, the charge on a conductor will seek the configuration that minimizes its potential 
energy. What property (iii) asserts is that the electrostatic energy of a solid object (with 
specified shape and total charge) is a minimum when that charge is spread over the surface. 
For instance, the energy of a sphere is ( 1 /8tt e 0 ) ( q 2 /R ) if the charge is uniformly distributed 
over the surface, as we found in Ex. 2.8, but it is greater, ( 3/20jre 0 )(q 2 /R ), if the charge is 
uniformly distributed throughout the volume (Prob. 2.32). 

2.5.2 Induced Charges 

If you hold a charge +q near an uncharged conductor (Fig. 2.44), the two will attract one 
another. The reason for this is that q will pull minus charges over to the near side and repel 
plus charges to the far side. (Another way to think of it is that the charge moves around in 
such a way as to cancel off the field of q for points inside the conductor, where the total 
field must be zero.) Since the negative induced charge is closer to q , there is a net force of 
attraction. (In Chapter 3 we shall calculate this force explicitly, for the case of a spherical 
conductor.) 



“By the way, the one- and two-dimensional analogs are quite different: The charge on a conducting disk does 
not all go to the perimeter (R. Friedberg, Am. J. ofPhys. 61, 1084 (1993)), nor does the charge on a conducting 
needle go to the ends (D. J. Griffiths and Y. Li, Am. J. ofPhys. 64, 706 (1996)). See Prob. 2.52. 
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Figure 2.44 



Figure 2.45 



By the way, when I speak of the field, charge, or potential “inside” a conductor, I 
mean in the “meat” of the conductor; if there is some cavity in the conductor, and within 
that cavity there is some charge, then the field in the cavity will not be zero. But in a 
remarkable way the cavity and its contents are electrically isolated from the outside world 
by the surrounding conductor (Fig. 2.45). No external fields penetrate the conductor; they 
are canceled at the outer surface by the induced charge there. Similarly, the field due to 
charges within the cavity is killed off, for all exterior points, by the induced charge on the 
inner surface. (However, the compensating charge left over on the outer surface of the 
conductor effectively “communicates” the presence of q to the outside world, as we shall 
see in Ex. 2.9.) Incidentally, the total charge induced on the cavity wall is equal and opposite 
to the charge inside, for if we surround the cavity with a Gaussian surface, all points of 
which are in the conductor (Fig. 2.45), <f E • da = 0, and hence (by Gauss’s law) the net 
enclosed charge must be zero. But gene = q + q induced • so e/ induced = -q. 

Example 2.9 

An uncharged spherical conductor centered at the origin has a cavity of some weird shape 
carved out of it (Fig. 2.46). Somewhere within the cavity is a charge q. Question: What is the 
field outside the sphere? 




Figure 2.46 
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Solution: At first glance it would appear that the answer depends on the shape of the cavity 

and on the placement of the charge. But that’s wrong: The answer is 



4tt€q r 2 

regardless. The conductor conceals from us all information concerning the nature of the cavity, 
revealing only the total charge it contains. How can this be? Well, the charge +q induces 
an opposite charge — q on the wall of the cavity, which distributes itself in such a way that 
its field cancels that of q, for all points exterior to the cavity. Since the conductor carries no 
net charge, this leaves +q to distribute itself uniformly over the surface of the sphere. (It’s 
uniform because the asymmetrical influence of the point charge +q is negated by that of the 
induced charge -q on the inner surface.) For points outside the sphere, then, the only thin g 
that survives is the field of the leftover +q, uniformly distributed over the outer surface. 

It may occur to you that in one respect this argument is open to challenge: There are actually 
three fields at work here, E 9 , Ej nt i uce( j , and Ei e f tover . All we know for certain is that the sum 
of the three is zero inside the conductor, yet I claimed that the first two alone cancel, while 
the third is separately zero there. Moreover, even if the first two cancel within the conductor, 
who is to say they still cancel for points outside? They do not, after all, cancel for points 
inside the cavity. I cannot give you a completely satisfactory answer at the moment, but this 
much at least is true: There exists a way of distributing — q over the inner surface so as to 
cancel the field of q at all exterior points. For that same cavity could have been carved out of 
a huge spherical conductor with a radius of 27 miles or light years or whatever. In that case 
the leftover +q on the outer surface is simply too far away to produce a significant field, and 
the other two fields would have to accomplish the cancellation by themselves. So we know 
they can do it . . . but are we sure they choose to? Perhaps for small spheres nature prefers 
some complicated three-way cancellation. Nope: As we’ll see in the uniqueness theorems of 
Chapter 3, electrostatics is very stingy with its options; there is always precisely one way — no 
more — of distributing the charge on a conductor so as to make the field inside zero. Having 
found a possible way, we are guaranteed that no alternative exists even in principle. 



If a cavity surrounded by conducting material is itself empty of charge, then the field 
within the cavity is zero. For any field line would have to begin and end on the cavity wall, 
going from a plus charge to a minus charge (Fig. 2.47). Letting that field line be part of a 
closed loop, the rest of which is entirely inside the conductor (where E = 0), the integral 




Figure 2.47 
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<f E • d\ is distinctly positive , in violation of Eq. 2. 19. It follows that E = 0 within an empty 
cavity, and there is in fact no charge on the surface of the cavity. (This is why you are 
relatively safe inside a metal car during a thunderstorm — you may get cooked, if lightning 
strikes, but you will not be electrocuted. The same principle applies to the placement of 
sensitive apparatus inside a grounded Faraday cage, to shield out stray electric fields. In 
practice, the enclosure doesn’t even have to be solid conductor — chicken wire will often 
suffice.) 



Problem 2.35 A metal sphere of radius R, carrying charge q, is surrounded by a thick concentric 
metal shell (inner radius a, outer radius b, as in Fig. 2.48). The shell carries no net charge. 

(a) Find the surface charge density a at R, at a, and at b. 

(b) Find the potential at the center, using infinity as the reference point. 

(c) Now the outer surface is touched to a grounding wire, which lowers its potential to zero 
(same as at infinity). How do your answers to (a) and (b) change? 



Problem 2.36 Two spherical cavities, of radii a and b, are hollowed out from the interior of a 
(neutral) conducting sphere of radius R (Fig. 2.49). At the center of each cavity a point charge 
is placed — call these charges q a and qb. 

(a) Find the surface charges o a . Of , , and or. 

(b) What is the field outside the conductor? 

(c) What is the field within each cavity? 

(d) What is the force on q a and <//,? 

(e) Which of these answers would change if a third charge, q c , were broughtnear the conductor? 





Figure 2.48 



Figure 2.49 
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2.5.3 Surface Charge and the Force on a Conductor 

Because the field inside a conductor is zero, boundary condition 2.33 requires that the field 
immediately outside is 

E = — n, (2.48) 

<?o 

consistent with our earlier conclusion that the field is normal to the surface. In terms of 
potential, Eq. 2.36 yields 

8V 

a = - 60 -T-- (2.49) 

on 

These equations enable you to calculate the surface charge on a conductor, if you can 
determine E or V ; we shall use them frequently in the next chapter. 

In the presence of an electric field, a surface charge will, naturally, experience a force; the 
force per unit area, f , is crE. But there’s a problem here, for the electric field is discontinuous 
at a surface charge, so which value are we supposed to use: E a h 0V e • Eb e ] 0 w. or something in 
between? The answer is that we should use the average of the two: 

f = CtE avera g e — “Cr(E a h 0 ve T E below)- (2.50) 

Why the average? The reason is very simple, though the telling makes it sound complicated: 
Let’s focus our attention on a small patch of surface surrounding the point in question 
(Fig. 2.50). Make it tiny enough so it is essentially flat and the surface charge on it is 
essentially constant. The total field consists of two parts — that attributable to the patch 
itself, and that due to everything else (other regions of the surface, as well as any external 
sources that may be present): 

E = E patch T E other • 

Now, the patch cannot exert a force on itself, any more than you can lift yourself by standing 
in a basket and pulling up on the handles. The force on the patch, then, is due exclusively 
to E ot her, and this suffers no discontinuity (if we removed the patch, the field in the “hole” 
would be perfectly smooth). The discontinuity is due entirely to the charge on the patch, 




Figure 2.50 
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which puts out a field (er/2eo) on either side, pointing away from the surface (Fig. 2.50). 
Thus, 

er „ 

E above = E other T z n, 

2e 0 



and hence 



E below — E other „ O, 

2e 0 



E other — ^ (E above T E below ) — E average- 



Averaging is really just a device for removing the contribution of the patch itself. 

That argument applies to any surface charge; in the particular case of a conductor, the 
field is zero inside and (er/eo)n outside (Eq. 2.48), so the average is (er/2eo)n, and the force 
per unit area is 

f=- 1 -a 2 n. (2.51) 

2eo 

This amounts to an outward electrostatic pressure on the surface, tending to draw the 
conductor into the field, regardless of the sign of a . Expressing the pressure in terms of the 
field just outside the surface, 

P = ~^E 2 . (2.52) 



Problem 2.37 Two large metal plates (each of area A) are held a distance d apart. Suppose 
we put a charge Q on each plate; what is the electrostatic pressure on the plates? 

Problem 2.38 A metal sphere of radius R carries a total charge Q. What is the force of 
repulsion between the “northern” hemisphere and the “southern” hemisphere? 



2.5.4 Capacitors 

Suppose we have two conductors, and we put charge +Q on one and — Q on the other 
(Fig. 2.51). Since V is constant over a conductor, we can speak unambiguously of the 
potential difference between them: 

p(+) 

v = V+-V- = - \ E • dl. 

J(-) 

We don’t know how the charge distributes itself over the two conductors, and calculating 
the field would be a mess, if their shapes are complicated, but this much we do know: E is 
proportional to Q. For E is given by Coulomb’s law: 
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Q 




Figure 2.51 



so if you double p, you double E. (Wait a minute! How do we know that doubling Q (and 
also — Q) simply doubles pi Maybe the charge moves around into a completely different 
configuration, quadrupling p in some places and halving it in others, just so the total charge 
on each conductor is doubled. The fact is (hat this concern is unwarranted — doubling Q 
does double p everywhere; it doesn 7 shift the charge around. The proof of this will come 
in Chapter 3; for now you’ll just have to believe me.) 

Since E is proportional to Q, so also is V. The constant of proportionality is called the 
capacitance of the arrangement: 

Q 

C = ^. (2.53) 

Capacitance is a purely geometrical quantity, determined by the sizes, shapes, and separation 
of the two conductors. In SI units, C is measured in farads (F); a farad is a coulomb-per-volt. 
Actually, this turns out to be inconveniently large; 9 more practical units are the microfarad 
(1 0 — 6 F) and the picofarad (10 -12 F). 

Notice that V is, by definition, the potential of the positive conductor less that of the 
negative one; likewise, Q is the charge of the positive conductor. Accordingly, capacitance 
is an intrinsically positive quantity. (By the way, you will occasionally hear someone 
speak of the capacitance of a single conductor. In this case the “second conductor,” with 
the negative charge, is an imaginary spherical shell of infinite radius surrounding the one 
conductor. It contributes nothing to the field, so the capacitance is given by Eq. 2.53, where 
V is the potential with infinity as the reference point.) 



Example 2.10 

Find the capacitance of a “parallel-plate capacitor” consisting of two metal surfaces of area A 
held a distance d apart (Fig. 2.52). 



d 




figure 2.52 



9 In the second edition I claimed you would need a forklift to carry a 1 F capacitor, 
case — you can now buy a 1 F capacitor that fits comfortably in a soup spoon. 



This is no longer the 
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Solution: If we put +Q on the top and — Q on the bottom, they will spread out uniformly over 
the two surfaces, provided the area is reasonably large and the separation distance small. 10 
The surface charge density, then, is cr = Q/ A on the top plate, and so the field, according to 
Ex. 2.5, is (\/€q)Q/ A. The potential difference between the plates is therefore 

Q 

V = —d, 

Ae 0 



and hence 



5£o 

d 



(2.54) 



If, for instance, the plates are square with sides 1 cm long, and they are held 1 mm apart, then 
the capacitance is 9 x 10 -13 F. 



Example 2.11 

Find the capacitance of two concentric spherical metal shells, with radii a and b. 



Solution: Place charge +Q on the inner sphere, and — Q on the outer one. The field between 
the spheres is 



47T60 r 2 

so the potential difference between them is 



V 



-j: 



- Edl=~- 



4ne 0 



r 

Jb 



1 Q 

-7T dr = 



4jt 6q \a 



As promised, V is proportional to Q\ the capacitance is 



c = 2= 4**0-^-. 
V (b-a) 




To “charge up” a capacitor, you have to remove electrons from the positive plate and 
carry them to the negative plate. In doing so you fight against the electric field, which is 
pulling them back toward the positive conductor and pushing them away from the negative 
one. How much work does it take, then, to charge the capacitor up to a final amount Q1 
Suppose that at some intermediate stage in the process the charge on the positive plate is 
q, so that the potential difference is q/C. According to Eq. 2.38, the work you must do to 
transport the next piece of charge, dq, is 



dW = (|) dq. 



10 The exact solution is not easy — evert for the simpler case of circular plates. See G. T. Carlson and B. L. lllman, 
Am. J. Phys. 62, 1099 (1994). 
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Problem 2.43 Find the net force that the southern hemisphere of a uniformly charged sphere 
exerts on the northern hemisphere. Express your answer in terms of the radius R and the total 
charge Q. [Answer: (\/4neo)(iQ 2 /\6R 2 )] 

Problem 2.44 An inverted hemispherical bowl of radius R carries a uniform surface charge 
density a . Find the potential difference between the “north pole” and the center. [Answer: 
(Ro /2e 0 )(j2 - 1)] 

Problem 2.45 A sphere of radius R carries a charge density p(r) = kr (where it is a constant). 
Find the energy of the configuration. Check your answer by calculating it in at least two 
different ways. [Answer: jr£ 2 /J 7 /7eo] 

Problem 2.46 The electric potential of some configuration is given by the expression 

e -Xr 

F(r) = A , 

r 

where A and X are constants. Find the electric field E(r), the charge density p(r), and the total 
charge Q. [Answer: p = eo5(47rc5 3 (r) — X 2 e~ Xr /r)\ 

Problem 2.47 Two infinitely long wires running parallel to the x axis carry uniform charge 
densities +A. and —X (Fig. 2.54). 

(a) Find the potential at any point (x, y, z), using the origin as your reference. 

(b) Show that the equipotential surfaces are circular cylinders, and locate the axis and radius 
of the cylinder corresponding to a given potential Vq- 



Problem 2.48 In a vacuum diode, electrons are “boiled” off a hot cathode, at potential zero, 
and accelerated across a gap to the anode, which is held at positive potential Vo- The cloud of 
moving electrons within the gap (called space charge) quickly builds up to the point where it 
reduces the field at the surface of the cathode to zero. From then on a steady current I flows 
between the plates. 

Suppose the plates are large relative to the separation (A » d 2 in Fig. 2.55), so that edge 
effects can be neglected. Then V , p, and v (the speed of the electrons) are all functions of x 
alone. 




Figure 2.54 



Figure 2.55 
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(a) Write Poisson’s equation for the region between the plates. 

(b) Assuming the electrons start from rest at the cathode, what is their speed at point x, where 
the potential is V (x)‘! 

(c) In the steady state, I is independent of x. What, then, is the relation between p and u? 

(d) Use these three results to obtain a differential equation for V, by eliminating p and v. 

(e) Solve this equation for V as a function of x, Vo. and d. Plot V (jc), and compare it to the 
potential without space-charge. Also, find p and v as functions of x. 

(f) Show that 

/ = KVq /2 , (2.56) 

and find the constant K. (Equation 2.56 is called the Child-Langmuir law. It holds for other 
geometries as well, whenever space-charge limits the current. Notice that the space-charge 
limited diode is nonlinear — it does not obey Ohm’s law.) 



Problem 2.49 Imagine that new and extraordinarily precise measurements have revealed an 
error in Coulomb’s law. The actual force of interaction between two point charges is found to 
be 



F = 



1 <?1<?2 
47T £Q k 2 






> — ^ / A. 






where k is a new constant of nature (it has dimensions of length, obviously, and is a huge 
number — say half the radius of the known universe — so that the correction is small, which is 
why no one ever noticed the discrepancy before). You are charged with the task of reformulating 
electrostatics to accommodate the new discovery. Assume the principle of superposition still 
holds. 



(a) What is the electric field of a charge distribution p (replacing Eq. 2.8)? 

(b) Does this electric field admit a scalar potential? Explain briefly how you reached your 
conclusion. (No formal proof necessary — -just a persuasive argument.) 

(c) Find the potential of a point charge q — the analog to Eq. 2.26. (If your answer to (b) was 
“no,” better go back and change it!) Use oo as your reference point. 

(d) For a point charge q at the origin, show that 



E • da -| — =■ f V dr = — q, 

k 2 J V 



eo 



where S is the surface, V the volume, of any sphere centered at q. 

(e) Show that this result generalizes: 



£ E • da + -y f V dr — — <2enc> 

JS ^ JV e 0 

for any charge distribution. (This is the next best thing to Gauss’s Law, in the new “electro- 
statics.”) 

(f) Draw the triangle diagram (like Fig. 2.35) for this world, putting in all the appropriate 
formulas. (Think of Poisson’s equation as the formula for p in terms of V, and Gauss’s law 
(differential form) as an equation for p in terms of E.) 
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Problem 2.50 Suppose an electric field E(x, y, z) has the form 



E x = ax, E y = 0, E z = 0 

where a is a constant. What is the charge density? How do you account for the fact that the 
field points in a particular direction, when the charge density is uniform? [This is a more subtle 
problem than it looks, and worthy of careful thought.] 

Problem 2.51 All of electrostatics follows from the 1 /r 2 character of Coulomb’s law, together 
with the principle of superposition. An analogous theory can therefore be constructed for 
Newton’s law of universal gravitation. What is the gravitational energy of a sphere, of mass M 
and radius R, assuming the density is uniform? Use your result to estimate the gravitational 
energy of the sun (look up the relevant numbers). The sun radiates at a rate of 3.86 x 10 26 W; 
if all this came from stored gravitational energy, how long would the sun last? [The sun is in 
fact much older than that, so evidently this is not the source of its power.] 



Problem 2.52 We know that the charge on a conductor goes to the surface, but just how it 
distributes itself there is not easy to determine. One famous example in which the surface 
charge density can be calculated explicitly is the ellipsoid: 



x 

a 



2 

2 



b 2 




In this case 11 



8 (x 2 y 2 

Anabc \ a 4 b 4 




(2.57) 



where Q is the total charge. By choosing appropriate values for a, b, and c, obtain (from 
Eq. 2.57): (a) the net (both sides) surface charge density cr(r) on a circular disk of radius R ; 
(b) the net surface charge density cr(jt) on an infinite conducting “ribbon” in the x y plane, 
which straddles the y axis from x = -a to x = a (let A be the total charge per unit length 
of ribbon); (c) the net charge per unit length X(x) on a conducting “needle”, running from 
x = — a to x = a. In each case, sketch the graph of your result. 



1 1 For the derivation (which is a real tour de force) see W. R. Smythe, Static and Dynamic Electricity, 3rd ed. 
(New York: Hemisphere, 1989), Sect. 5.02. 





Chapter 3 

Special Techniques 



3.1 Laplace’s Equation 

3.1.1 Introduction 

The primary task of electrostatics is to find the electric field of a given stationary charge 
distribution. In principle, this purpose is accomplished by Coulomb’s law, in the form of 
Eq. 2.8: 

E( r ) = — - — f -yp{r')dx'. (3.1) 

47re 0 J n. L 

Unfortunately, integrals of this type can be difficult to calculate for any but the simplest 
charge configurations. Occasionally we can get around this by exploiting symmetry and 
using Gauss’s law, but ordinarily the best strategy is first to calculate the potential, V, which 
is given by the somewhat more tractable Eq. 2.29: 

V(r) = — ' — f -p(r')dx'. (3.2) 

4iteo J n. 

Still, even this integral is often too tough to handle analytically. Moreover, in problems 
involving conductors p itself may not be known in advance: since charge is free to move 
around, the only thing we control directly is the total charge (or perhaps the potential) of 
each conductor. 

In such cases it is fruitful to recast the problem in differential form, using Poisson’s 
equation (2.24), 

9 1 

V 2 V = p, (3.3) 

eo 

which, together with appropriate boundary conditions, is equivalent to Eq. 3.2. Very often, 
in fact, we are interested in finding the potential in a region where p = 0. (If p = 0 
everywhere, of course, then V — 0, and there is nothing further to say — that’s not what I 
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mean. There may be plenty of charge elsewhere, but we’re confining our attention to places 
where there is no charge.) In this case Poisson’s equation reduces to Laplace’s equation: 

V 2 V=0, (3.4) 



or, written out in Cartesian coordinates, 



d 2 V d 2 V 9 2 V _ 

dx 2 3 y 2 3 z 2 



(3.5) 



This formula is so fundamental to the subject that one might almost say electrostatics is 
the study of Laplace’s equation. At the same time, it is a ubiquitous equation, appearing in 
such diverse branches of physics as gravitation and magnetism, the theory of heat, and the 
study of soap bubbles. In mathematics it plays a major role in analytic function theory. To 
get a feel for Laplace’s equation and its solutions (which are called harmonic functions), 
we shall begin with the one- and two-dimensional versions, which are easier to picture 
and illustrate all the essential properties of the three-dimensional case (though the one- 
dimensional example lacks the richness of the other two). 



3.1.2 Laplace’s Equation in One Dimension 

Suppose V depends on only one variable, x. Then Laplace’s equation becomes 



d 2 V 
dx 2 



= 0 . 



The general solution is 



V {x) = mx + b. 



(3.6) 



the equation for a straight line. It contains two undetermined constants (m and b), as 
is appropriate for a second-order (ordinary) differential equation. They are fixed, in any 
particular case, by the boundary conditions of that problem. For instance, it might be 
specified that V = 4 at x = 1, and V = 0 at x = 5. In that case m = - 1 and b = 5, so 
V — —x + 5 (see Fig. 3.1). 




Figure 3.1 
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I want to call your attention to two features of this result; they may seem silly and 
obvious in one dimension, where I can write down the general solution explicitly, but the 
analogs in two and three dimensions are powerful and by no means obvious: 

1 . V(x ) is the average of V (x + a) and V(x — a), for any a: 

V(x) = \[V{x +a) + V(x — a)}. 

Laplace’s equation is a kind of averaging instruction; it tells you to assign to the point 
x the average of the values to the left and to the right of x. Solutions to Laplace’s 
equation are, in this sense, as boring as they could possibly be, and yet fit the end 
points properly. 

2. Laplace’s equation tolerates no local maxima or minima; extreme values of V must 
occur at the end points. Actually, this is a consequence of (1), for if there were a local 
maximum, V at that point would be greater than on either side, and therefore could 
not be the average. (Ordinarily, you expect the second derivative to be negative at 
a maximum and positive at a minimum. Since Laplace’s equation requires, on the 
contrary, that the second derivative be zero, it seems reasonable that solutions should 
exhibit no extrema. However, this is not a proof, since there exist functions that have 
maxima and minima at points where the second derivative vanishes: x 4 , for example, 
has such a minimum at the point x = 0.) 



3.1.3 Laplace’s Equation in Two Dimensions 

If V depends on two variables, Laplace’s equation becomes 

d 2 V d 2 V _ 

3x 2 3 y 2 

This is no longer an ordinary differential equation (that is, one involving ordinary derivatives 
only); it is a partial differential equation. As a consequence, some of the simple rules you 
may be familiar with do not apply. For instance, the general solution to this equation doesn’t 
contain just two arbitrary constants — or, for that matter, any finite number — despite the fact 
that it’s a second-order equation. Indeed, one cannot write down a “general solution” (at 
least, not in a closed form like Eq. 3.6). Nevertheless, it is possible to deduce certain 
properties common to all solutions. 

It may help to have a physical example in mind. Picture a thin rubber sheet (or a soap 
film) stretched over some support. For definiteness, suppose you take a cardboard box, cut 
a wavy line all the way around, and remove the top part (Fig. 3.2). Now glue a tightly 
stretched rubber membrane over the box, so that it fits like a drum head (it won’t be a flat 
drumhead, of course, unless you chose to cut the edges off straight). Now, if you lay out 
coordinates (x, y) on the bottom of the box, the height V (x , y ) of the sheet above the point 
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Figure 3.2 



(x, v) will satisfy Laplace’s equation. 1 (The one-dimensional analog would be a rubber 
band stretched between two points. Of course, it would form a straight line.) 

Harmonic functions in two dimensions have the same properties we noted in one di- 
mension: 



1. The value of V at a point ( x , y) is the average of those around the point. More 
precisely, if you draw a circle of any radius R about the point (x. y), the average 
value of V on the circle is equal to the value at the center: 

VU - y, = 2^ j VdL 

circle 



(This, incidentally, suggests the method of relaxation on which computer solutions 
to Laplace’s equation are based: Starting with specified values for V at the boundary, 
and reasonable guesses for V on a grid of interior points, the first pass reassigns to 
each point the average of its nearest neighbors. The second pass repeats the process, 
using the corrected values, and so on. After a few iterations, the numbers begin to 
settle down, so that subsequent passes produce negligible changes, and a numerical 
solution to Laplace’s equation, with the given boundary values, has been achieved.) 2 

2. V has no local maxima or minima; all extrema occur at the boundaries. (As before, 
this follows from (1).) Again, Laplace’s equation picks the most featureless func- 
tion possible, consistent with the boundary conditions: no hills, no valleys, just the 
smoothest surface available. For instance, if you put a ping-pong ball on the stretched 
rubber sheet of Fig. 3.2, it will roll over to one side and fall off — it will not find a 



Actually, the equation satisfied by a rubber sheet is 



3 

dx 




where g = 




it reduces (approximately) to Laplace’s equation as long as the surface does not deviate too radically from a plane. 

2 See, for example, E. M. Purcell, Electricity and Magnetism, 2nd ed., problem 3.30 (p. 1 19) (New York: 
McGraw-Hill, 1985). 
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“pocket” somewhere to settle into, for Laplace’s equation allows no such dents in 
the surface. From a geometrical point of view, just as a straight line is the shortest 
distance between two points, so a harmonic function in two dimensions minimizes 
the surface area spanning the given boundary line. 

3.1.4 Laplace’s Equation in Three Dimensions 

In three dimensions I can neither provide you with an explicit solution (as in one dimension) 

nor offer a suggestive physical example to guide your intuition (as I did in two dimensions). 

Nevertheless, the same two properties remain true, and this time I will sketch a proof. 

1. The value of V at point r is the average value of V over a spherical surface of radius 
R centered at r: 

' /(r) = s ! F / vda ■ 

sphere 

2. As a consequence, V can have no local maxima or minima; the extreme values of V 
must occur at the boundaries. (For if V had a local maximum at r, then by the very 
nature of maximum I could draw a sphere around r over which all values of V — and 
a fortiori the average — would be less than at r.) 

Proof: Let’s begin by calculating the average potential over a spherical surface 
of radius R due to a single point charge q located outside the sphere. We may 
as well center the sphere at the origin and choose coordinates so that q lies on 
the z-axis (Fig. 3.3). The potential at a point on the surface is 



4jreo ’ 

where 

'Z 2 = z 2 + R 2 — 2 zR cos 6, 
so 

Lave = 



But this is precisely the potential due to q at the center of the sphere! By the 
superposition principle, the same goes for any collection of charges outside the 
sphere: their average potential over the sphere is equal to the net potential they 
produce at the center, qed 



f [z 2 + R 2 -2zRcosdr l/2 R 2 sin Odd dtp 
R 2 4jte 0 J 



AnR 2 

q 



Atteo 2 zR 

_J7 ]_ 

Attcq 2 zR 



— \/z 2 + R 2 — 2 zR cos f 



[(z + R) - (z - R)] = 



1 q 



4jre 0 z 
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Problem 3.1 Find the average potential over a spherical surface of radius R due to a point 
charge q located inside (same as above, in other words, only with z < R ). (In this case, of 
course, Laplace’s equation does not hold within the sphere.) Show that, in general, 

Lave = ^center + ~ , 

477 eo I? 

where V ce nter is the potential at the center due to all the external charges, and (2enc is the total 
enclosed charge. 



Problem 3.2 In one sentence, justify Earnshaw’s Theorem: A charged particle cannot be 
held in a stable equilibrium by electrostatic forces alone. As an example, consider the cubical 
arrangement of fixed charges in Fig. 3.4. It looks, off hand, as though a positive charge at 
the center would be suspended in midair, since it is repelled away from each comer. Where 
is the leak in this “electrostatic bottle”? [To harness nuclear fusion as a practical energy 
source it is necessary to heat a plasma (soup of charges particles) to fantastic temperatures — so 
hot that contact would vaporize any ordinary pot. Eamshaw’s theorem says that electrostatic 
containment is also out of the question. Fortunately, it is possible to confine a hot plasma 
magnetically.] 




Figure 3.4 
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Problem 3.3 Find the general solution to Laplace’s equation in spherical coordinates, for the 
case where V depends only on r . Do the same for cylindrical coordinates, assuming V depends 
only on s. 



3.1.5 Boundary Conditions and Uniqueness Theorems 

Laplace’s equation does not by itself determine V; in addition, a suitable set of boundary 
conditions must be supplied. This raises a delicate question: What are appropriate boundary 
conditions, sufficient to determine the answer and yet not so strong as to generate incon- 
sistencies? The one-dimensional case is easy, for here the general solution V = mx + b 
contains two arbitrary constants, and we therefore require two boundary conditions. We 
might, for instance, specify the value of the function at the two ends, or we might give the 
value of the function and its derivative at one end, or the value at one end and the derivative 
at the other, and so on. But we cannot get away with just the value or just the derivative at 
one end — this is insufficient information. Nor would it do to specify the derivatives at both 
ends — this would either be redundant (if the two are equal) or inconsistent (if they are not). 

In two or three dimensions we are confronted by a partial differential equation, and it 
is not so easy to see what would constitute acceptable boundary conditions. Is the shape 
of a taut rubber membrane, for instance, uniquely determined by the frame over which it 
is stretched, or, like a canning jar lid, can it snap from one stable configuration to another? 
The answer, as I think your intuition would suggest, is that V is uniquely determined by 
its value at the boundary (canning jars evidently don’t obey Laplace’s equation). However, 
other boundary conditions can also be used (see Prob. 3.4). The proof that a proposed set of 
boundary conditions will suffice is usually presented in the form of a uniqueness theorem. 
There are many such theorems for electrostatics, all sharing the same basic format — I’ll 
show you the two most useful ones. 3 

First uniqueness theorem: The solution to Laplace’s equation in some volume 
V is uniquely determined if V is specified on the boundary surface S. 

Proof: In Fig. 3.5 I have drawn such a region and its boundary. (There could 
also be “islands” inside, so long as V is given on all their surfaces; also, the 
outer boundary could be at infinity, where V is ordinarily taken to be zero.) 
Suppose there were two solutions to Laplace’s equation: 

V 2 Vi=0 and V 2 V 2 = 0, 

both of which assume the specified value on the surface. I want to prove that 
they must be equal. The trick is look at their difference: 

V 3 = Vi~ v 2 . 



3 I do not intend to prove the existence of solutions here — that’s a much more difficult job. In context, the 
existence is generally clear on physical grounds. 
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V specified 
on this 
surface ( S ) 



V wanted iri 
this volume 



• V) 



f 



Figure 3.5 



This obeys Laplace’s equation, 

V 2 V 3 = V 2 Vi - V 2 V 2 = 0, 

and it takes the value zero on all boundaries (since Vi and V 2 are equal there). 
But Laplace’s equation allows no local maxima or minima — all extrema oc- 
cur on the boundaries. So the maximum and minimum of V 3 are both zero. 
Therefore V 3 must be zero everywhere, and hence 

Vi = V 2 . qed 



Example 3.1 



Show that the potential is constant inside an enclosure completely surrounded by conducting 
material, provided there is no charge within the enclosure. 

Solution: The potential on the cavity wall is some constant, Vo (that’s item (iv), in Sect. 2.5.1), 
so the potential inside is a function that satisfies Laplace’s equation and has the constant value 
Vo at the boundary. It doesn’t take a genius to think of one solution to this problem: V = Vo 
everywhere. The uniqueness theorem guarantees that this is the only solution. (It follows that 
the field inside an empty cavity is zero — the same result we found in Sect. 2.5.2 on rather 
different grounds.) 



The uniqueness theorem is a license to your imagination. It doesn’t matter how you 
come by your solution; if (a) it satisfies Laplace’s equation and (b) it has the correct value 
on the boundaries, then it’s right. You’ll see the power of this argument when we come to 
the method of images. 

Incidentally, it is easy to improve on the first uniqueness theorem: I assumed there was 
no charge inside the region in question, so the potential obeyed Laplace’s equation, but 





